Skip to content

lulumengyi/Automatic-Summarization-System

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Application and implementation of People China Daily text automatic summarization system based on word2vec algorithm

  • Trained a 300-dimensional Word2Vec model using Gensim on a Chinese news corpus.
  • Adopted the SIF(Smooth Inverse Frequency)algorithm to apply PCA, reducing word vector dimensions and obtaining superior sentence embeddings.
  • Calculated the cosine similarity between sentences and the title/article as a weighted score for summarization.Refined these scores with KNN smoothing and selected the top-k sentences based on rank.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages