Skip to content

Latest commit

 

History

History
45 lines (32 loc) · 1.38 KB

README.md

File metadata and controls

45 lines (32 loc) · 1.38 KB

structure-vision

Streamlit application, visualising the structures extracted by Grobid that include PDF coordinates. It uses the Streamlit component streamlit-pdf-viewer we've been developing.

This application allows you to visualise the following components:

  • authors
  • affiliations
  • title
  • head of sections
  • paragraphs or sentences
  • callout references in text
  • figures
  • formulas
  • bibliographic references

In addition, the sidebar contains other options, mostly for testing the various

Demo: https://structure-vision.streamlit.app/

screenshot1.png

This tool was built as a test application for streamlit-pdf-viewer, a new streamlit component for visualising enhanced PDF documents.

Getting started

pip install -r requirements
streamlit run streamlit_app.py

The Grobid URL should be provideed via environment variables, or via a .env file, which should be created in the the root folder of the project. Here an example of how the project is configured to hit the local Grobid instance:

GROBID_URL=http://localhost:8070

Developer notes

To install the Streamlit PDF viewer via github:

pip install -e git+https://github.com/lfoppiano/streamlit-pdf-viewer