Skip to content
This repository has been archived by the owner on Mar 21, 2020. It is now read-only.

giuseppetotaro/StringsParser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

StringsParser

This is a preliminary work for using the strings (or strings-alternative) command in Apache Tika.

Getting started

This repository includes the implementation of the StringsParser, a parser that uses the strings command in order to extract ASCII strings from binary files.

Use build.sh and run.sh scripts (under Unix-like OS) for compiling and testing StringsParser.

##TIKA-1541 An issue has been created on the Tika Issue Tracker system: TIKA-1541. Please refer to that issue for comments and feedback.

Notes

The repository is organized as follows: govdocs1/016 This folder contains some files from govdocs1 (#016 subset) that Tika is not able to detect. These files are marked as application/octet-stream.

  • src

This folder includes .java source files. In addition to StringsParser.java and StringsConfig.java (the main classes), there is also a extremely simple StringsTest.java for launching the parser against a single file. The latter source file uses a simple utility called Timer.java.

  • README.md

This README file.

  • build.sh

This scripts compiles the .java source code using the javac command.

  • run.sh

This scripts runs the StringsParser against the files in the govdocs1/016 folder.

##License Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

About

Preliminary work for the Strings Parser.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published