About EnOS Offline Analytics¶
EnOS Offline Analytics Service is mainly targeted for the batch processing of data at rest to obtain insights from the data.
Batch processing is used in a variety of scenarios, from simple data transformations to a more complete ETL (extract-transform-load) pipeline. In a big data context, batch processing may operate over very large data sets, where the computation takes significant time. Batch processing typically leads to further interactive exploration, provides the modeling-ready data for machine learning, or writes the data to a data store that is optimized for analytics and visualization. One example of batch processing is to transform a large set of flat, semi-structured CSV or JSON files into a schematized and structured format that is ready for further querying. Typically the data is converted from the raw formats used for ingestion (such as CSV) into binary formats that are more performant for querying because they store data in a columnar format, and often provide indexes and inline statistics about the data.
EnOS Offline Analytics Service provides end-to-end toolset that enables you to load your data from various of data connection into a unified analytical datastore in EnOS, process the data through orchestrating workflows, and quickly and interactively analyze the data. The scalable cloud cluster of EnOS allows you to process huge amount of data on demand.
EnOS Offline Analytics Service primarily serves the following roles:
Data Developer & Analyst
Whose responsibilities are to:
- Develop data structure and algorithm
- Perform data analysis and data mining
- Design BI reports based on business KPIs to assist business decisions
The service provides an data integrator that helps you synchronize data between extensive heterogeneous data connection, such as to synchronize structured data from source databases to the Hive database in EnOS Cloud and from EnOS Cloud to external target databases. Learn more >>.
EnOS Offline Analytics Service provides a GUI-based Data IDE that enables batch data processing based on scheduled workflows. The batch processing operations supported various from simple data transformations to complete ETL (extract-transform-load). The IDE also provides common libraries out-of-the-box for most frequently used data processing operations. Learn more >>
Based on the open-source Apache Zeppelin project, EnOS Offline Analytics Service provide a light-weighted, notebook based data explorer that lowers the barrier for data processing, and enables data-driven, collaborative, interactive data analytics. Learn more >>