Skip to content

LukasKriesch/BE_patents

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This repository contains the code used for creating a dataset of bioeconomy-related inventions. The repository contains three files:

train_pat.py: Contains the code for fine-tuning an SBERT-based model using SetFit.

topic_modeling_be_pats.py: Provides the code for generating a topic model with the the BERTopic-Framework.

technical_validation.py: Includes the code for comparing different pretrained SBERT models and evaluating them against keyword-based approaches for identifying bioeconomy-related inventions. This also includes code for a stratified validation of the classification results by CPC section.

The data folder contains the training data used in train_pat.py. The validation folder in the data folder contains the data used for the stratified validation of the classification results by CPC section.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages