Softpedia
 


SCRIPTS CATEGORIES:



NEWS ARCHIVE >>
SOFTPEDIA REVIEWS >>
MEET THE EDITORS >>
WEEK'S BEST
  • Koken 0.8.2
  • ContentBox 1.5.2
  • jQPlayer 0.5.2
  • SPOILER ALERT! 0.0.2
  • jQuery Mask Plugin 0.9.0
  • Easing Slider 2.1.2
  • Btapp.js 0.2.0
  • WiiFlash 0.4.5
  • Breeze.js 1.3.3
  • TinyMCE Templates 3.0.2
  • Home > Scripts > Development Tools > Other Libraries

    juniversalchardet 1.0.3

    Download button


    Downloads: 636  Tell us about an update
    User Rating:
    Rated by:
    NOT RATED
    0 user(s)
    Developer:

    Website:

    License / Price:

    Platforms:

    Databases:

    Language:

    Last Updated:

    Category:
    Kohei Taketa | More scripts
    code.google.com
    Other Free / Open Source License - MPL Mozilla Public License 

    Windows / Linux / Mac OS / BSD / Solaris
    N/A
    Java
    November 7th, 2009, 01:58 GMT
    C: \ Development Tools \ Other Libraries

     Read user reviews (0)  Refer to a friend  Subscribe

    juniversalchardet description

    This is a Java port of 'universalchardet', that is the encoding detector library used by Mozilla

    This is an implementation of the original Netscape library, which is currently used by Mozilla.

    Installation:
    1. Construct an instance of org.mozilla.universalchardet.UniversalDetector.
    2. Feed some data (typically several thousands bytes) to the detector by calling UniversalDetector.handleData().
    3. Notify the detector of the end of data by calling UniversalDetector.dataEnd().
    4. Get the detected encoding name by calling UniversalDetector.getDetectedCharset().
    5. Don't forget to call UniversalDetector.reset() before you reuse the detector instance.

    Here are some key features of "juniversalchardet":

    Chinese:
    · ISO-2022-CN
    · BIG5
    · EUC-TW
    · GB18030
    · HZ-GB-23121

    Cyrillic:
    · ISO-8859-5
    · KOI8-R
    · WINDOWS-1251
    · MACCYRILLIC
    · IBM866
    · IBM855

    Greek:
    · ISO-8859-7
    · WINDOWS-1253

    Hebrew:
    · ISO-8859-8
    · WINDOWS-1255

    Japanese:
    · ISO-2022-JP
    · SHIFT_JIS
    · EUC-JP

    Korean:
    · ISO-2022-KR
    · EUC-KR

    Unicode:
    · UTF-8
    · UTF-16BE / UTF-16LE
    · UTF-32BE / UTF-32LE / X-ISO-10646-UCS-4-34121 / X-ISO-10646-UCS-4-21431

    Others:
    · WINDOWS-1252



    TAGS:

    encoding detector | Java library | character detection | character | encoding | detector

    Go to top

    WindowsGamesDriversMacLinuxScriptsMobileHandheldNews

    SUBMIT PROGRAM   |   ADVERTISE   |   GET HELP   |   SEND US FEEDBACK   |   RSS FEEDS   |   UPDATE YOUR SOFTWARE   |   ROMANIAN FORUM