Currently tail-free sampling is what's used by defaults. Some people prefer top-k and others. We could add such options to generation settings.