Inxight Software has announced Categorizer 5.0, a new, hybrid categorization engine with automated techniques and editorial controls to ensure accuracy. Version 5.0 can also deploy third-party and industry-standard taxonomies and includes a "workbench" feature designed to facilitate the creation of testing of taxonomies.
Categorizer 5.0 is available as a component within the Inxight SmartDiscovery Extraction Server platform and works in concert with the company's ThingFinder extraction and Summarizer modules to structure unstructured text in more than 30 languages. Using the new workbench capabilities, categories can be defined manually (rules only), with learn-by-example only, or using both techniques together.
Further, Inxight says, the workbench provides a robust set of testing tools for taxonomy editors to define, maintain and test their taxonomies before deployment on a SmartDiscovery server. It also includes many other new features, including an interface to assign training documents, a rule-writing helper and a testing environment for the creation of truth sets and accuracy reporting.