|
|
Invariant™ Overview:
Quick Summary
- Completely flexible system can be customized on the fly
- Import and export any dataset while providing more investigative data points
- Massive processing power reduces turn around times
- Powerful defensible culling goes far beyond industry standards
- Dynamic
searching and filtering
with no reloads required
- Recognizes
over 2,500 file types from multiple operating
systems
- Built
from the ground up for Unicode and multi-language support
- Discovers all deeply embedded documents and Images
- Extremely
accurate rendering independent of print driver
limitations
- Most robust exporting available including interactive full color PDF

Technical Overview
System Features
- Fully
Distributed Processing
Workloads for projects
are shared across multiple multi-threaded worker
machines. Discovery, data extract, TIFF'ing and OCR
are all distributed and load balanced.
- Full
Unicode Support
Designed for Unicode
from the ground up. Original client data
which is not in Unicode is up-converted to Unicode. Data
delivery to 3rd party applications can be accomplished
in Unicode, ANSI, RTF or more than two dozen other
text encodings depending upon the application requirements
of the end user.
- Internationalized
Full-Text Search
Full text searches
on extracted text can currently be performed in
122 languages.
Discovery
- File
Identification
Invariant™ can identify
over 2,500 types of files from multiple operating
systems and examines file content rather
than file extension to determine file type.
- Investigative Metadata
Invariant never assumes a predefined set
of metadata fields. Instead, our software first
walks files in order to preserve volatile metadata
while examining and capturing any/all metadata
found within native files. Once new metadata fields
are discovered they are given a unique ID for cataloging
and added to our progressive metadata library (currently
more than 45,000 metadata fields).
- File
Formats
Invariant currently supports over 2,500 file types including all major email and data files. Below are just a few examples:
• All standard files and file types (email, word processing, spreadsheets...)
• Adobe Acrobat including packages and portfolioshandled correctly
• Bloomberg emails
• Microsoft Snapshot files
• CAB file archives
• RAR compressed files
• Full Office 2007 support including customized extraction and rendering
• XPS fixed and flowed documents
• Can extract annotations, comments and attachments from PDF files
• Rapidly address custom or rare file types
- Deep
Embedding
Invariant uncovers
deeply embedded objects and sub-documents
(for example, an email with attachments dragged
and dropped into a Word document; an embedded
Word document inside an Excel spreadsheet)
through a process of infinite recursion.
- Dates & Times
Dates and times
are stored in UTC. Time zone adjustments can be
made during the data export phase rather than having
to preset the time zone before discovery begins.
- TIFF'ing
and OCR
The OCR engine is only
utilized on pages where an image is present.
If a page of a document contains a mixture of
text and graphics, the text is extracted separately
and then the graphics are OCR'd. The resulting
text for that page will physically separate what
was OCR'd versus what was text extracted.
OCR engine supports 122 languages, including Chinese Traditional, Chinese Simplified,
Korean and Japanese.
Exporting & Load Files
- Highly
Customizable
Invariant’s powerful
export features allow the combination of export
tasks or split export tasks into separate work
units. Load files can be built rapidly without
having to copy the native files or images, allowing
for quick work verification. This technology
also allows us the ability to resume an export
without overwriting the existing destination
files.
- Rich-Text
Supported
Invariant™ supports
rich-text so complete hit-highlighting from search
requests can be included in the exported text.
- Image
Endorsements on TIFF/PDF
Export documents as TIFF
images or PDFs with full color and interactive
links. Endorsements on PDF exports are fully-searchable
and can be completely customized.
- Internationalized
Output
Export text in RTF, UTF-8,
UTF-16, Unicode, ANSI or more than two dozen
other text encodings.
- Wide
Range of Built-in Export Definitions
Extensible Export Mechanism
means we can provide a wide range of built-in
export definitions with Invariant. We
deliver a customized export solution to fit
any third-party application requirements including,
but not limited to: Concordance®, CT Summation
and Ringtail®.
|
|
|