Softpedia
 


SCRIPTS CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • PeoplePods 0.9
  • Brackets Sprint 8
  • elFinder 2.0 RC1
  • BBClone 0.6.1
  • Twitter Follow Box...
  • Multilingual Press...
  • SimplePie 1.2.1
  • TinyTips 1.2
  • SWFUpload 2.2.0.1 ...
  • Head Cleaner 1.4.2.9
  • Home > Scripts > Search Engines

    Apache Solr 3.6.0

    Download button


    Downloads: 1,200  Tell us about an update
    User Rating:
    Rated by:
    Very Good (4.0/5)
    4 user(s)
    Developer:

    Website:

    License / Price:

    Platforms:

    Databases:

    Language:

    Last Updated:

    Category:
    Apache Software Foundation | More scripts
    lucene.apache.org
    Apache License 

    Windows / Linux / Mac OS / BSD / Solaris
    N/A
    Java
    April 17th, 2012, 10:30 GMT [view history]
    C: \ Search Engines

     Read user reviews (0)  Refer to a friend  Subscribe

    Apache Solr description

    This is an open source enterprise search server based on the Lucene Java search library

    It has XML/HTTP, Ruby, JSON, and Python APIs.

    Solr is highly scalable, providing distributed search and index replication, and it powers the search and navigation features of many of the world's largest internet sites.

    Solr is written in Java and runs as a standalone full-text search server within a servlet container such as Tomcat. Solr uses the Lucene Java search library at its core for full-text indexing and search, and has REST-like HTTP/XML and JSON APIs that make it easy to use from virtually any programming language.

    It's powerful external configuration allows it to be tailored to almost any type of application without Java coding, and it has an extensive plugin architecture when more advanced customization is required.

    Here are some key features of "Apache Solr":

    General Features:
    · Advanced Full-Text Search Capabilities
    · Optimized for High Volume Web Traffic
    · Standards Based Open Interfaces - XML,JSON and HTTP
    · Comprehensive HTML Administration Interfaces
    · Server statistics exposed over JMX for monitoring
    · Scalability - Efficient Replication to other Solr Search Servers
    · Flexible and Adaptable with XML configuration
    · Extensible Plugin Architecture
    · A Real Data Schema, with Numeric Types, Dynamic Fields, Unique Keys
    · Powerful Extensions to the Lucene Query Language
    · Faceted Search and Filtering
    · Advanced, Configurable Text Analysis
    · Highly Configurable and User Extensible Caching
    · Performance Optimizations
    · External Configuration via XML
    · An Administration Interface
    · Monitorable Logging
    · Fast Incremental Updates and Index Replication
    · Highly Scalable Distributed search with sharded index across multiple hosts
    · XML, CSV/delimited-text, and binary update formats
    · Easy ways to pull in data from databases and XML files from local disk and HTTP sources
    · Rich Document Parsing and Indexing (PDF, Word, HTML, etc) using Apache Tika
    · Multiple search indices

    Schema:
    · Defines the field types and fields of documents
    · Can drive more intelligent processing
    · Declarative Lucene Analyzer specification
    · Dynamic Fields enables on-the-fly addition of new fields
    · CopyField functionality allows indexing a single field multiple ways, or combining multiple fields into a single searchable field
    · Explicit types eliminates the need for guessing types of fields
    · External file-based configuration of stopword lists, synonym lists, and protected word lists
    · Many additional text analysis components including word splitting, regex and sounds-like filters

    Query:
    · HTTP interface with configurable response formats (XML/XSLT, JSON, Python, Ruby, PHP, Velocity, binary)
    · Sort by any number of fields
    · Advanced DisMax query parser for high relevancy results from user-entered queries
    · Highlighted context snippets
    · Faceted Searching based on unique field values, explicit queries, or date ranges
    · Multi-Select Faceting by tagging and selectively excluding filters
    · Spelling suggestions for user queries
    · More Like This suggestions for given document
    · Function Query - influence the score by user specified complex functions of numeric fields or query relevancy scores.
    · Range filter over Function Query results
    · Date Math - specify dates relative to "NOW" in queries and updates
    · Dynamic search results clustering using Carrot2
    · Numeric field statistics such as min, max, average, standard deviation
    · Combine queries derived from different syntaxes
    · Auto-suggest functionality
    · Allow configuration of top results for a query, overriding normal scoring and sorting
    · Performance Optimizations

    Core:
    · Dynamically create and delete document collections without restarting
    · Pluggable query handlers and extensible XML data format
    · Pluggable user functions for Function Query
    · Customizable component based request handler with distributed search support
    · Document uniqueness enforcement based on unique key field
    · Duplicate document detection, including fuzzy near duplicates
    · Custom index processing chains, allowing document manipulation before indexing
    · User configurable commands triggered on index changes
    · Ability to control where docs with the sort field missing will be placed
    · "Luke" request handler for corpus information

    Caching:
    · Configurable Query Result, Filter, and Document cache instances
    · Pluggable Cache implementations, including a lock free, high concurrency implementation
    · Cache warming in background
    · Autowarming in background
    · Fast/small filter implementation
    · User level caching with autowarming support

    Replication:
    · Efficient distribution of index parts that have changed
    · Pull strategy allows for easy addition of searchers
    · Configurable distribution interval allows tradeoff between timeliness and cache utilization
    · Replication and automatic reloading of configuration files

    Admin Interface:
    · Comprehensive statistics on cache utilization, updates, and queries
    · Interactive schema browser that includes index statistics
    · Replication monitoring
    · Full logging control
    · Text analysis debugger, showing result of every stage in an analyzer
    · Web Query Interface with debugging output

    What's New in This Release: [ read full changelog ]

    · New SolrJ client connector using Apache HTTP Components HTTP client.
    · Many analyzer factories are now "multi term query aware" allowing for things like field type aware lowercasing when building prefix & wildcard queries.
    · New Kuromoji morphological analyzer tokenizes Japanese text, producing both compound words and their segmentation.
    · Range Faceting (Dates & Numbers) is now supported in distributed search.
    · HTMLStripCharFilter has been completely re-implemented, fixing many bugs and greatly improving the performance.
    · StreamingUpdateSolrServer now supports the javabin format.
    · New LFU Cache option for use in Solr's internal caches.
    · Memory performance improvements to all FST based suggesters.
    · New WFSTLookupFactory suggester supports finer-grained ranking for suggestions.
    · New options for configuring the amount of concurrency used in distributed searches.
    · Many bug fixes.



    TAGS:

    search server | search engine | find string | search | server | engine



    HTML code for linking to this page:


    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM