T5-base model create spelling mistake is summary

jiten-duhan · September 14, 2020, 9:33am

Hi,

I am using T5-base model for abstractive summarization, results are good but I am getting newly generated spelling mistakes in the summary which were not actually present in input text.
Can anyone tell me why these spelling mistakes occuring and how can I solve this?

Zack · October 24, 2020, 8:02pm

I think it’s due to your min_output size, for example if you have forced the model to generate results at minimum more than 50 sequence, and somehow the prediction length predicted only 40 sequences, I think it will start to generate random tokens just to reach the 50 seq.

jiten-duhan · November 5, 2020, 8:42am

Hi Zack, hope you are doing well !!
Thank you for your reply.

Actually it is not generating random tokens, but it is misspelling them.

For e.g a word “productive” in input text is spelled as “priductive”

Topic		Replies	Views
T5 Generates very short summaries 🤗Transformers	22	5792	September 11, 2020
LongT5 tGlobal Base Extractive Result Models	0	149	September 1, 2023
Long summarization 🤗Transformers	0	348	August 9, 2022
Distilling T5-small for summarization 🤗Transformers	0	480	May 25, 2022
Generating summaries with encdoer input + few decoder inputs using T5 Beginners	0	268	April 28, 2022

T5-base model create spelling mistake is summary

Related topics