Softpedia
 


SCRIPTS CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • PeoplePods 0.9
  • Brackets Sprint 8
  • elFinder 2.0 RC1
  • BBClone 0.6.1
  • Twitter Follow Box...
  • Multilingual Press...
  • SimplePie 1.2.1
  • TinyTips 1.2
  • SWFUpload 2.2.0.1 ...
  • Head Cleaner 1.4.2.9
  • Home > Scripts > Search Engines

    Apache Nutch 1.4

    Download button


    Downloads: 1,011  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    Website:

    License / Price:

    Platforms:

    Databases:

    Language:

    Last Updated:

    Category:
    Apache Software Foundation | More scripts
    nutch.apache.org
    Apache License 

    Windows / Linux / Mac OS / BSD / Solaris
    N/A
    Java
    December 1st, 2011, 10:14 GMT [view history]
    C: \ Search Engines

     Read user reviews (0)  Refer to a friend  Subscribe

    Apache Nutch description

    This is an open source Java web-search software

    It builds on Lucene Java, adding new web-specifics, such as parsers for HTML, a crawler, a link-graph database and other document formats.

    What's New in This Release: [ read full changelog ]

    · Added Solr 4x (trunk) example schema.
    · Added '/runtime' to svn ignore.
    · Application/xhtml+xml should be enabled in plugin.xml of parse-html; allow multiple mimetypes for plugin.xml.
    · Fixed parse-tika and parse-html to use relative URL resolution per RFC-3986.
    · Upgraded to Tika 0.10. NOTE: Tika's new RTF parser may ignore more text in malformed documents than previously - see TIKA-748 for details.
    · Added Sonar targets to Ant build.xml.
    · Upgraded SolrJ to version 3.4.0.
    · Ant pmd target is broken.
    · Upgraded Solr schema to version 1.4.



    TAGS:

    search engine | web crawler | HTML parser | search | engine | crawler



    HTML code for linking to this page:


    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM