It is suitable for any application that requires full-text search, especially cross-platform.
Here are some key features of "Apache Lucene":
Scalable, High-Performance Indexing:
· Over 20MB/minute on Pentium M 1.5GHz
· Small RAM requirements -- only 1MB heap
· Incremental indexing as fast as batch indexing
· Index size roughly 20-30% the size of text indexed
Powerful, Accurate and Efficient Search Algorithms:
· Ranked searching -- best results returned first
· Many powerful query types: phrase queries, wildcard queries, proximity queries, range queries and more
· Fielded searching (e.g., title, author, contents)
· Date-range searching
· Sorting by any field
· Multiple-index searching with merged results
· Allows simultaneous update and searching
What's New in This Release: [ read full changelog ]
· In addition to Java 5 and Java 6, this release has now full Java 7 support (minimum JDK 7u1 required).
· TypeTokenFilter filters tokens based on their TypeAttribute.
· Fixed offset bugs in a number of CharFilters, Tokenizers and TokenFilters that could lead to exceptions during highlighting.
· Added phonetic encoders: Metaphone, Soundex, Caverphone, Beider-Morse, etc.
· CJKBigramFilter and CJKWidthFilter replace CJKTokenizer.
· Kuromoji morphological analyzer tokenizes Japanese text, producing both compound words and their segmentation.
· Static index pruning (Carmel pruning) removes postings with low within-document term frequency.
· QueryParser now interprets '*' as an open end for range queries.
· FieldValueFilter excludes documents missing the specified field.
· CheckIndex and IndexUpgrader allow you to specify the specific FSDirectory implementation to use with the new -dir-impl command-line option.
· FSTs can now do reverse lookup (by output) in certain cases and can be packed to reduce their size. There is now a method to retrieve top N shortest paths from a start node in an FST.
· New WFSTCompletionLookup suggester supports finer-grained ranking for suggestions.
· FST based suggesters now use an offline (disk-based) sort, instead of in-memory sort, when pre-sorting the suggestions.
· ToChildBlockJoinQuery joins in the opposite direction (parent down to child documents).
· New query-time joining is more flexible (but less performant) than index-time joins.
· Added HTMLStripCharFilter to strip HTML markup.