Has anyone achieved a speed-up with this model?
#3 opened 2 months ago
by
RonanMcGovern

Add text-generation pipeline tag and MIT license
#2 opened 3 months ago
by
nielsr

Is this MTP head just for predicting one token ahead?
#1 opened 3 months ago
by
RonanMcGovern
