Skip to content

0.10.0

Compare
Choose a tag to compare
@bab2min bab2min released this 19 Dec 15:54
· 195 commits to main since this release
  • The interface of tomotopy.utils.Corpus and of tomotopy.LDAModel.docs were unified. Now you can access the document in corpus with the same manner.
  • getitem of tomotopy.utils.Corpus was improved. Not only indexing by int, but also by Iterable[int], slicing are supported. Also indexing by uid is supported.
  • New methods tomotopy.utils.Corpus.extract_ngrams and tomotopy.utils.Corpus.concat_ngrams were added. They extracts n-gram collocations using PMI and concatenates them into a single words.
  • A new method tomotopy.LDAModel.add_corpus was added, and tomotopy.LDAModel.infer can receive corpus as input.
  • A new module tomotopy.coherence was added. It provides the way to calculate coherence of the model.
  • A paramter window_size was added to tomotopy.label.FoRelevance.
  • An issue was fixed where NaN often occurs when training tomotopy.HDPModel.
  • Now Python3.9 is supported.
  • A dependency to py-cpuinfo was removed and the initializing of the module was improved.