Skip to content

(jruby) extract text and tables from PDFs using Mozilla's tabula-extract or just straightup OCR.

License

Notifications You must be signed in to change notification settings

noahpryor/pdflib

Repository files navigation

	#install jruby
	bundle install
	#run with example schema
	bundle exec jruby page_extractor.rb

About

(jruby) extract text and tables from PDFs using Mozilla's tabula-extract or just straightup OCR.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages