You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The interface of tomotopy.utils.Corpus and of tomotopy.LDAModel.docs were unified. Now you can access the document in corpus with the same manner.
getitem of tomotopy.utils.Corpus was improved. Not only indexing by int, but also by Iterable[int], slicing are supported. Also indexing by uid is supported.
New methods tomotopy.utils.Corpus.extract_ngrams and tomotopy.utils.Corpus.concat_ngrams were added. They extracts n-gram collocations using PMI and concatenates them into a single words.
A new method tomotopy.LDAModel.add_corpus was added, and tomotopy.LDAModel.infer can receive corpus as input.
A new module tomotopy.coherence was added. It provides the way to calculate coherence of the model.
A paramter window_size was added to tomotopy.label.FoRelevance.
An issue was fixed where NaN often occurs when training tomotopy.HDPModel.
Now Python3.9 is supported.
A dependency to py-cpuinfo was removed and the initializing of the module was improved.