IMPORTANT: This appliction is for reference use only. It is NOT maintained and contains references to now outdated libraries that include security vulnerabilities.
DatavArk is an automated, domain-specific information acquisition and extraction platform.
This prototype was developed to gather data in the domain of Unexplained Anomalous Phenomena (UAP). The app ingests unstructured textual reports submitted to NUFORC.org and posted on Reddit.com. Extracted entities are recorded in a PostGIS SQL database.
Natural Language Processing (NLP) is implemented through a custom-trained, transformer-based machine learning model, deployed through the spaCy Python library. The web app is written using the Python Django framework.
The project was solely authored by Dan Bright, [email protected]