GroupDocs.Classification for .NET is a versatile developer API used to build text and document classification/categorisation applications. The API supports four unique taxonomies types and has sophisticated document and text classification by using IAB-2 for designating standardised text categories, Documents taxonomy for a large number of document formats, or Sentiment (and Sentiment3) for the sentiment analysis.
The library analyses text, sentences, words, and supports classifying various industry-standard document formats including Microsoft Word, OpenDocument, RTF, PDF and TXT. Sentiment analysis (classification) supports Chinese, Spanish, English and German auto-detection or languages.
Using its native document processing/classification engine, GroupDocs.Classification for .NET does not need any components to work. It works on the .NET platform and supports Windows, Linux, macOS where .NET frameworks (including .NET Core) can be installed.
GroupDocs.Classification for .NET supports a number of popular document formats.
Microsoft Office
- Word: DOC, DOCX, DOCM, DOT, DOTX, DOTM, RTF
Other Formats
- Fixed Layout: PDF
- OpenDocument: ODT, OTT
- Text: TXT
Exact Document Classification
The GroupDocs.Classification API supports classification for a range of document formats.
Exact Text Classification
The GroupDocs.Classification API also supports text classification with four different taxonomies:
- · IAB-2
- · Documents
- · Sentiment
- · Sentiment3.
Exact Multilingual Sentiment Analysis
The GroupDocs.Classification for .NET allows developers to perform cross-domain Sentiment Analysis (Classification) in Spanish, English, Chinese and German. GroupDocs.Classification for .NET will detect the appropriate language(s) automatically.