Servient is built on a true “Big Data” platform. The building blocks of Servient include Hadoop, HBase, SOLR and other technologies that power the largest web companies in the world. Servient breaks down large volumes of unstructured data into small computation tasks that can be executed in parallel. Servient then distributes the computation across multiple resources allowing the application to perform at scale.
Servient permits companies to quickly ingest data into the Servient archive and automatically organize the data using Servient's advanced machine-learning models. Servient allows the enterprise to search, explore and analyze their unstructured data content at scale.
Hadoop indexes and analyzes terabytes of data in MapReduce and provides our clients with processing throughput flexibility with greater cost control. Hadoop enables Servient to easily adapt to data “bursts” in processing loads. Intensive increases in client data needs can be easily managed by adding more hardware to the Hadoop infrastructure to increase the throughput to process the client's data and then scale down the Hadoop cluster once the data is processed.
HBase is a NoSQL datastore that acts as an infinitely scalable document store. Servient utilizes HBase, which runs on top of Hadoop, to provide random realtime read/write access to our client’s Big-Data projects. Servient utilizes HBase to provide a more scalable solution with superior performance when handling complex data structures.
Solr is a scalable enterprise search engine that Servient uses to drive its powerful search function. Servient’s machine learning technology combined with Solr’s major features that includes powerful full-text search, hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration and rich document (e.g., Word, PDF) handling greatly improves the rate of recall.
Spark enables Servient’s machine learning to run at enterprise scale. Servient provides a variety of proprietary supervised and unsupervised machine learning algorithms that work in unison with those available on Spark. This blend of Servient’s proprietary algorithms working alongside Spark’s algorithms enables sophisticated workflows coupled with search to identify and categorize the relevant information in a highly superior manner than our competitors.
Servient is a Hadoop based Knowledge Discovery Archive (KDA) that combines web scale processing with archiving technology. It connects, archives and processes a variety of unstructured and structured enterprise data sources. The Hadoop based NoSQL data model allows support for multiple extensible evolving taxonomies. The solution enables several sophisticated workflows that combine search with unsupervised and supervised machine learning algorithms running on the Hadoop platform allowing the user to visualize and interact with the data in order to search and categorize. The end user has seamless integration to utilize Servient’s compliance and eDiscovery applications to interact with the data in Servient KDA.
In response to requests from federal regulators, a global corporation was challenged with finding responsive documents within 30 terabytes of data. This case study outlines how Servient's eDiscovery Platform's web scale capability was utilized to help the client create an efficient process that saved $2 million.