http://www.open-search-server.com
Copyright Emmanuel Keller / Jaeksoft (2008-2013) This software is licensed under the GPL v3.
OpenSearchServer is a powerful, enterprise-class, search engine program. Using the web user interface, the crawlers (web, file, database, ...) and the REST/RESTFul API you will be able to integrate quickly and easily advanced full-text search capabilities in your application. OpenSearchServer runs on Linux/Unix/BSD/Windows.
You need to have a JAVA 6 (or newer) runtime on your server
http://www.open-search-server.com/download
FILE
opensearchserver.jar -> the main libraryFILE
README.md -> this fileDIR
data -> will contains your indexDIR
server -> will contains servers filesFILE
start.sh -> Shell to start the server on UnixFILE
start.bat -> Batch to start the server on WindowsFILE
NOTICE.txt -> the third-party license informationsDIR
LICENSES -> Contains the detailled licenses
Optionally, can you change the parameters in the start.sh/start.bat script:
- The allowed memory size
- The TCP port (9090 by default)
cd opensearchserver
./start.sh
- Download binaries: http://www.open-search-server.com/download
- The documentation of the RESTFul API: https://github.com/jaeksoft/opensearchserver/wiki
- The user guide: http://www.open-search-server.com/documentation
- Issues (bugs, enhancements): https://github.com/jaeksoft/opensearchserver/issues
- Advanced full-text search features
- Phonetic search
- Advanced boolean search with query language
- Clustered results with faceting and collapsing
- Filter search using sub-requests (including negative filters)
- Geolocation
- Spell-checking
- Relevance customization
- Search suggestion facility (auto-completion)
- Supports 17 languages
- Fields schema with analyzers in each language
- Several filters: n-gram, lemmatization, shingle, stripping diacritic from words,…
- Automatic language recognition
- Named entity recognition
- Word synonyms and expression synonyms
- Export indexed terms with frequencies
- Automatic classification
- HTML / XHTML
- MS Office documents (Word, Excel, Powerpoint, Visio, Publisher)
- OpenOffice documents
- Adobe PDF (with OCR)
- RTF, Plaintext
- Audio files metadata (wav, mp3, AIFF, Ogg)
- Torrent files
- OCR over images
- The web crawler for internet, extranet and intranet
- The file systems crawler for local and remote files (NFS, SMB/CIFS, FTP, FTPS, SWIFT)
- The database crawler for all JDBC databases (MySQL, PostgreSQL, Oracle, SQL Server, …)
- Filter inclusion or exclusion with wildcards
- Session parameters removal
- SQL join and linked files support
- Screenshot capture
- Sitemap import
- REST API (XML and JSON)
- SOAP Web Service
- Monitoring module
- Index replication
- Scheduler for management of periodic tasks
- WordPress plugin and Drupal module