The Datameer X trial version provided with MapR's distribution includes several ready-to-use demos for exploring the functionality of Datameer X and illustrating big data analytics use cases in the areas of customer behavior analytics and IT systems management.
...
This demo analyzes email content, looking at time distribution and finding patterns of word use, response to specific topics. This demo consists of the workbook, EmailAnalytics
, the dashboard Email_Analytics_Dashboard
, and utilizes the data set email_archives
.
IT Systems Management
Log file analytics
This demo examines web traffic via raw Apache log files to summarize various types of errors experienced by users, identifying hotspots by time of day, page, error and potential security threats. This demo consists of the workbook WebserverErrorAnalysis
, the dashboard Web_Errors_Overview
, and utilizes the data set weblogs.
MapR cluster usage
This demo aggregates and summarizes resource utilization and availability information from a MapR cluster, including CPU, memory, Hadoop jobs, service status, and errors. Dashboards visualize trends in these metrics over time. The demo is based on a approximately one month of historical data captured from a large MapR cluster via a REST API. This doesn't represent data from your MapR cluster, although ingestion of this data into Datameer X is possible. See Datameer's documentation for more information.
This demo also highlights the ETL capabilities of Datameer X cleansing and transforming raw, hierarchical semi-structured data in a JSON format into metrics analysts can easily work with. The data was collected from MapR's REST-based monitoring API. The MapR_Cluster_ETL
workbook triggers the MapR_Cluster_Analytics
workbook automatically upon completion (something you can modify by clicking Configure after selecting the Mapr_Cluster_Analytics
workbook).
The demo visualizes the results of the analytics pipeline in the dashboards MapR_Cluster_Utilization
, and MapR_Cluster_Status
, and utilizes two data sets: MapR_cluster_stats
, which contains system-level information and MapR_node_stats
which contains node-level information.