Detecting structure in verbose CSV files via classifying lines and cells.
- This project is implemented in Python 3.7.7.
- Use the following command to download all required libraries for Python:
pip install -r requirements.txt
- We recommend to install the required libraries in a separated virtual environment.
- Use the following script to run the Strudel program:
python run_strudel.py
The following arguments can be used for the above script:
-
-d: training dataset
-
-t: test dataset. If not given, the program does cross-validation on the training dataset
-
-f: dataset path
-
-o: output path
-
Results are stored in a csv file.
- 0.1
- Initial Release
This project is licensed under the Apache License 2.0 License - see the LICENSE.md file for details
- Co-authors
- Contributors
Please contact Lan Jiang if you have any questions or want to report bugs.
If you find this repository useful in your work, please cite our EDBT'21 paper:
@inproceedings{jiang2021structure,
title={Structure Detection in Verbose CSV Files.},
author={Jiang, Lan and Vitagliano, Gerardo and Naumann, Felix},
booktitle={EDBT},
pages={193--204},
year={2021}
}