Open Source search comparison
Apache Solr Search is a powerfull open source search Engine platform based on the Apache Lucene search library. Solr search is one of the most widely deployed search engine platforms all over the world Solr is written in Java, and provides both a RESTful XML interface, and a JSON API with which search applications can be built. The Solr search engine enjoys a reputation for being Intensily stable, fexible, and trustworthy. It is under constant growth by a large community of open source committers, under the direction of the Apache Software Foundation. Software Foundation provides alot of set of core search functions. In experienced hands, it is an extremely powerful and flexible platform on which search-based and big data analytics applications can be built.
Sphinx Search :
Sphinx is totally an open source fully text search server. It is designed on C++ and can also works on Linux , Windows, MacOS, Solaris, FreeBSD, and a few other systems.
The Sphinx batch index and search data that are stored in SQL database and NoSQL storage it also index and search data on the fly, working with Sphinx is similar like working with a database server.
A variety of text processing features enable fine-tuning Sphinx for specific application requirements, number of relevances functions ensure search quality.
Searching via SphinxAPI is very simple just 3 lines of code, and querying via SphinxQL is even much simpler, with search queries expressed in good old SQL.
Performance and scalability
- Indexing performance :
Sphinx can indexes up to 10 to 15 MB of text per seconds on a scale of single CPU core 60 above MB per sec/server on a dedicated indexing machine.
- Searching performance:
Searching via 1 lac documents, 1.2 GB text collections that we normally use for daily development and testing that runs at 500 above queries/sec on a 2-core desktop machines with 2 GB of RAM.
- The famous Sphinx cluster indexes upto 25+ billion documents that resulting in 9TB of data. The popular one is Craigslist which is serving upto 300+ million search queries/day.