InfoDigger™ is full-text search engine capable to instantly dig in hundreds of gigabytes or terabytes of unstructured information, databases and other external sources. It supports numerous of file formats, compressed archives, SQL servers, data sources, as follows:
- MS Office Documents - Supports MS Office formats;
- Open Document formats;
- Text Files - Plain text, RTF, CSV, UTF-8 based text source (files, database fields or blobs, streamed paragraphs, etc.);
- File Archives - automatically unpacks and indexes the content of ZIP, CAB, CHM, CPIO, CramFS, DEB, DMG, FAT, HFS, ISO, LZH, LZMA, MBR, MSI, NSIS, NTFS, RAR, RPM, SquashFS, UDF, VHD, WIM, XAR, Z, and ARJ;
- PDF files;
- XML, HTML, and other derivative formats;
- Databases access through custom interface.
The technology of InfoDigger™ posses the ability to be extended with new data type plug-ins in order to access directly external sources like remote Internet or Intranet information servers (Web, FTP, POP, IMAP, others). Combined with AGE technology this engine can provide distributed search capabilities, access to your existing data sources, and much more.
FEATURED SEARCH
You can use one or more of the InfoDigger's features, as follows:
- Intuitive search language, which follows the general syntax rules accepted by most of the world spoken languages;
- Unicode compliant - accepts search requests in different languages encoded with UTF-8;
- Automatic language transcoding - the Engine do its best to transcode and normalize search phrases, so you will get the most relevant results from your documents;
- Complete or partial wild-card comparison - you can use the standard ASCII symbols ? and * to replace one or more letters in your search requests;
- Supports regular expressions;
- Provides dynamical document ranking, and groups the results by type;
- Automatic merge of results even when the relevant documents are coming from different sources or locations;
- Fuzzy search capabilities - allows to find relevant documents using synonym words, auto spell corrections, and external thesaurus dictionaries.
With InfoDigger™ you can search transparently in structured and unstructured data sources using predefined or custom fields. Therefore you can access simultaneously directories, databases, parts of XML files or Web pages.
EMBEDDING AND DEVELOPMENT
You can integrate InfoDigger™ search engine into your applications using the AGE standard RESTful API. Therefore, you can use it practically in any modern language or on development platform, including but not limited to .NET, Java, Erlang, and much more...