Skip to content

Commit

Permalink
Merge pull request #43 from adbar/main
Browse files Browse the repository at this point in the history
update readme: add Trafilatura
  • Loading branch information
diegosiqueir4 authored Oct 29, 2024
2 parents 605d784 + 341b129 commit 157e425
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -42,6 +42,7 @@ This is a curated list of tools, resources, and services supporting the Digital
- [OpenArchive](https://open-archive.org/) - Making it easy to store, share, and amplify your mobile media while protecting your identity.
- [Open EU Data Portal](https://data.europa.eu/euodp/en/data/) - European Union open data.
- [Social Feed Manager](https://gwu-libraries.github.io/sfm-ui/) - Open source software that harvests social media data and web resources from Twitter, Tumblr, Flickr, and Sina Weibo.
- [Trafilatura](https://trafilatura.readthedocs.io/) - Open source software to gather text and metadata on the Web: Crawling, scraping, extraction, output in multiple formats. Usable with Python, R and on the command-line.
- [Transkribus](https://transkribus.eu/) - Transcribe. Collaborate. Share and benefit from cutting edge research in Handwritten Text Recognition!
- [Textgrid](https://textgrid.de/) - Open source tools and services support humanistic scholars during the entire process of research, especially in digital scholarly editing.
- [webrecorder.io](https://webrecorder.io/) - Web archiving service anyone can use for free to save web pages.
Expand Down

0 comments on commit 157e425

Please sign in to comment.