June 5, 2014

Features:Diversified Data

esProc is specialized incomputing from various different data sources, and the computing result can be written back to multiple or single data source. esProc not only supports the rational database, MongoDB, Cassandra, and other NoSQL database, but also provides a great many functions to handle the structured data and the semi-structured data computing.

esProc can also directly retrieve the files on native machine and LAN, or seamlessly access the distributed file system like HDFS. It supports both the common txt file or Excel sheets, and the particular files inself-defined format with great performance.

Rational database: esProc supports all JDBC-enabled databases such as Oracle, MSSQL, MYSQL, and DB2 etc.

Text data source: esProc supports the structured text files and the customized separators for rows and columns,such as *.txt,*.log files.

esProc also supports Excel files of various versions directly and the binary files in proprietary format.

Seamlessly access HDFS: esProc has the in-built function to access HDFS, with the compatible access route.It supports the multi-node computing to read and retrieve the big data file in HDFS with cursor.

NoSQL database: esProc can access NoSQL databases with JDBC, including MongoDB, Cassandra, etc.

Semi-structured data: esProc provides the third party programming interface for users to develop the customized esProc function to resolve the XML, SOAP, or other types of semi-structured data. The customized functions can all be applied in the structured data computing similar to native esProc functions.

Besides retrieval, the computing result of esProc can also be written back to the original data source or data sources of various types, or written into multiple data sources at the same time. Similarly, esProc has the inbuilt functions to write back various data sources, including modifying a single record, and writing back the massive data.

As esProc provides the consistent JDBC interface for upper level applications, esProc and data sources can co-build the easy-to-use hybrid database. In the past, multi-data-source computing requires the high-end reporting tools, hard-to-maintain ETL, and expensive data warehouse. 

esProc isn’t binded with any specific data source, but it supports the combined computing of various data sources by nature. esProc can reduce the difficulty of corelating the big data and the traditional database, remove the restriction on single-source report, and enable Java application to confront the increasingly complex data environment.