#title Charset Detecting [[TableOfContents]] = °³¿ä = * °¢ ¾ð¾îº° character set detecting À» À§ÇÑ library ¹× API Á¤¸® * Charset detecting ¿¡´Â Å©°Ô 2°¡Áö open µÈ library ¸¦ ÀÌ¿ë * [https://www-archive.mozilla.org/projects/intl/chardet.html Mozilla Universal Charset Detect] * [http://icu-project.org/userguide/charsetDetection.html IBM International Components for Unicode Project] = ¾ð¾îº° API = || ¾ð¾î || API À̸§ || URL || || C++ || Mozilla (Original) || https://dxr.mozilla.org/mozilla-central/source/intl/chardet || || C/C++ || libchardet || https://github.com/Joungkyun/libchardet || || C# || UDE || https://github.com/errepi/ude || || Java || Jchardet || http://jchardet.sourceforge.net/ || || Python || chardet || https://chardet.github.io/ or https://github.com/Joungkyun/python-chardet (c-binding) || || Ruby || chardet || https://rubygems.org/search?query=chardet || || Perl || Encode-Detect || https://metacpan.org/pod/release/JGMYERS/Encode-Detect-1.01/Detector.pm || || PHP || mod_chardet || https://github.com/OOPS-ORG-PHP/mod_chardet || = ¾ð¾îº° API Ư¡ = == Original Mozilla Code == * C++ ±â¹Ý * browser Äڵ忡¼­ ºÐ¸®ÇØ ³»±â°¡ ½±Áö ¾Ê´Ù. * ¿äÁò ¹öÀüÀÇ °æ¿ì intl/chardet/test ¿¡ sample source °¡ ÀÖ¾î ¹«¾ð°¡ ¿©Áö°¡ ÀÖÀ» µí.. == Nchardet == * C# ¿ë API * Mozilla Universal Charset Detect Algorithm * 2016.05 ÇöÀç ÇÁ·ÎÁ§Æ® ¾ø¾îÁø µí. * GitHub¿¡ [https://github.com/errepi/ude UDE]¶ó´Â ÇÁ·ÎÁ§Æ®°¡ ÀÖÀ½ == Jchardet == * Java ¿ë chardet class API * Mozilla Universal Charset Detect Algorithm * sourceforge ¿¡¼­ ÁøÇà Áß * ¿ª½Ã »ìÆì º¸Áö´Â ¾Ê¾ÒÀ½ == py-chardet == * Python API * Mozilla Universal Charset Detect Algorithm * python pure code ·Î ÀÛ¼º * ¼Óµµ°¡ ´À¸° ´ÜÁ¡ * ±×·¡µµ °¡Àå À¯¸íÇÔ. (°Ë»ö¿¡ °¡Àå ¸¹ÀÌ °É¸²) * °³¹ßÀÌ Áß´ÜÀÌ µÈ °Í °°°í, ¸î¸î forking versionµéÀÌ ÀÖÀ½ * https://github.com/chardet/chardet * https://pypi.python.org/pypi/chardet * python-chardet c binding module * pure code·Î ÀÛ¼ºÇÑ °ÍÀÌ ´À¸° ÀÌÀ¯·Î c bindingÀ¸·Î Á¦ÀÛ * https://github.com/Joungkyun/python-chardet * libchardet ÀÌ ÇÊ¿ä * https://github.com/Joungkyun/libchardet == chardet ruby == * gemÀ¸·Î ¿©·¯°¡Áö¸¦ Áö¿øÇÑ´Ù. * https://rubygems.org/search?query=chardet ÂüÁ¶ == Encode-Detect == * Perl API * Mozilla Universal Charset Detect Algorithm * C++ binding ÀÌ¶ó¼­ ¼Óµµ°¡ ÁÁ´Ù. * ¸î¸î ¾ð¾î¿¡ ´ëÇÏ¿© ¹ö±×°¡ ÀÖ´Ù. * ¹ö±×°¡ ÀÖÀ¸³ª, python chardet º¸´Ù´Â Á¤È®µµ°¡ ³ô´Ù == libchardet == * C/C++ library ( libchardet.so/libchardet.a support) * Mozilla Universal Charset Detect Algorithm * Encode-Detect 1.01 ÀÇ C++ binding Äڵ带 ¼öÁ¤ * C wrapping API Á¦°ø * Encode-Detect ÀÇ ¹ö±× ¼öÁ¤ * python chardet/Encode-Detect º¸´Ù´Â Á¤È®µµ°¡ ³ô´Ù * 2010/07 ÇöÀç libchardet ÀÌ ¿©·¯°³°¡ Á¸ÀçÇÑ´Ù. ´ëºÎºÐ Encode-DetectÀÇ C++ binding À¸·Î, frontend API°¡ Á¶±Ý¾¿ ´Ù¸£´Ù. * https://github.com/Joungkyun/libchardet * http://code.google.com/p/libibase/source/browse/#svn/trunk/devel/libchardet * http://github.com/cnangel/libchardet/ * https://www.freedesktop.org/wiki/Software/uchardet/ == mod_chardet == * PHP API * libchardet / libicu / Python C API binding * Mozilla Universal Charset Detect Algorithm * ICU Conversion Charset Detection Alghrotim * Python Chardet bindng (with Python C API) * Requrires: libchardet that is distributed from oops.org