You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Documents have multiple sentences so how do you deal with that ? Do you split the text into sentences and the concatenate the final embeddings for each sentence or do you remove all punctuation marks so the text won't have any [SEP] tokens.
The text was updated successfully, but these errors were encountered:
thank you for your issue
for document classification, we do not split the text into sentences (except the Hierarchical methods)
we do not remove punctuation masks. for the whole document, we regard it as a long sentence.
thank you for your issue
for document classification, we do not split the text into sentences (except the Hierarchical methods)
we do not remove punctuation masks. for the whole document, we regard it as a long sentence.
hi, could you tell me how to code with different numbers of sentences in the hierachical methods? (variant length of inputs)
Hi sorry to bother you, but I have one question.
Documents have multiple sentences so how do you deal with that ? Do you split the text into sentences and the concatenate the final embeddings for each sentence or do you remove all punctuation marks so the text won't have any [SEP] tokens.
The text was updated successfully, but these errors were encountered: