30 Digits, most well known for their world class Web Extractor which continues to structure data from the Internet and deliver it perfectly for repurposing, is also breaking ground in the internal data access market with innovations like the newest release of the Alfresco Extractor. Albeit lesser known, the internal data extraction, transformation, and loading (ETL) processes performed by the 30 Digits Extractors are powering systems in the world’s largest financial institutions, governments, and corporations. The application of them varies from cost saving actions consolidating data centers to making data more accessible through powering Enterprise Search applications to scanning and monitoring data for legal analysis.
We all know that the data in the Internet is growing exponentially, but what many miss is that the data internally is growing at an even faster rate (see the Cisco Visual Networking Index: Forecast and Methodology, 2010–2015 to get an idea of this data explosion). Couple this with the M&A activities of companies and the multitude of data systems out there, and the ability to get at data, wherever it is at, and repurpose or consolidate that in other systems becomes essential. This is what 30 Digits specializes in and in which it excels.
It is also worth mentioning that with the considerable consolidation going on in the search world (HP acquiring Autonomy and Oracle purchasing Endeca), it is going to be even more important to have an independent third party handling information retrieval and data transition without an agenda to push a particular content platform.
Add to all of these trends the tectonic changes happening in the software world caused by Open Source, it is no surprise that Alfresco has become a popular and widely used document management system (DMS). Knowing this and having many customers with Alfresco, 30 Digits developed the Alfresco Extractor in February this year. It has been well adopted and appreciated for its ease of use and quality.
As the customers requesting it became larger and their data stores reached into the hundreds of terabytes, the SOAP method of accessing data no longer achieved the necessary results. Hence 30 Digits took on the challenge of making an Extractor that could handle this scale with speed, accuracy, and reliability. The outcome of that is the Alfresco Extractor 2.0 which works over the REST API. Capturing versioning of documents, auto-tagging, normalization and transformation of fields, and many more features which were loved in the original are still there. On top of those features have come some great new ones like the visual tree for selecting branches of documents from which to gather data out of Alfresco.
To know more about the Alfresco Extractor or other 30 Digits products, email us at firstname.lastname@example.org.