Datameer
Datameer
Datameer
Reach the next level in data discovery. With Datameer, you can access and use
any type of data, anywhere in the world.
Freely upload and parse any file formats for analysis, including:
• HTML
• JSON
• XML
• CSV
• Text documents
• Apache logs
• Extract from 70+ data sources and a variety of pre-built data formats
• REST API allows any new file formats that suit your needs
• Examples include custom data parsing, tokenizing for text parsing and time
zone support
• Extract matching patterns with advanced parsing through regular
expressions
PA G E 2
Datameer D ATA S H E E T
Flipside™ also allows you to uncover hidden patterns. And with better profiling,
you’ll have a better sense of the data’s accuracy.
More accuracy and more recently refreshed data provides better insights.
PA G E 3
D ATA M E E R F O R D ATA P R E PA R AT I O N : E X P LO R E , P R O F I L E ,
B L E N D, C L E A N S E , E N R I C H , S H A R E , O P E R AT I O N A L I Z E Datameer
Who said data cleansing had to be tedious? With Datameer, you can quickly
identify outliers, duplicates, inconsistent values and more.
Identify outliers
Handle duplicates
Remove inconsistencies
Use the formula builder for any advanced patterns in the dataset. It identifies
edge cases or unique patterns in the data and whether they need to be
corrected.
PA G E 4
Datameer D ATA S H E E T
• Time saving
• Column splitting
• Column and row pivoting
• Statistical grouping
• Advanced text parsing
• Working with lists (concatenate fields to lists and expand to rows)
• Path construction
• Powerful if and comparison functions
• Date, time and text manipulation functions
• Enrich datasets inline with support for expressions
Analytic enrichment
Create valuable insights from data to complete the journey of data prep and
data discovery. Datameer provides a rich array of capabilities to enrich datasets
with analytics including:
• Path analysis
• Graph analytics
• Statistical functions
• Clustering
• Correlations
• Decision trees
• Text mining
• Sentiment analysis
PA G E 5
D ATA M E E R F O R D ATA P R E PA R AT I O N : E X P LO R E , P R O F I L E ,
B L E N D, C L E A N S E , E N R I C H , S H A R E , O P E R AT I O N A L I Z E Datameer
Find hidden patterns and trends in the data. Organize data during the data
preparation phase in different ways than with typical dimensions and metrics
used by standard analytics. The Datameer function API allows users to write
functions for domain specific transformations.
• Sessionization
• Custom binning
• Time windowing
• Advanced statistical grouping
• Function API to create domain-rich datasets
• Fraud
• Preventative maintenance
• Buying patterns
• Clickstream analysis
• Time-series analysis
• And more
Organize and search for different data artifacts used across the enterprise
Organize files
PA G E 6
Datameer D ATA S H E E T
PA G E 7
D ATA M E E R F O R D ATA P R E PA R AT I O N : E X P LO R E , P R O F I L E ,
B L E N D, C L E A N S E , E N R I C H , S H A R E , O P E R AT I O N A L I Z E Datameer
Datameer maintains full lineage, auditing the definition of and any changes
Datameer Enterprise to artifacts and calculations, and supporting even the most rigid compliance
• User Management processes.
• LDAP / AD Integration
• SAML / Single Sign-On
Datameer also includes a deep suite of encryption and obfuscation features,
• Permissions and sharing
• Role-based security (with custom allowing you to keep data such as Personally Identifiable Information (PII) safe
roles) and secure using SHA1 plugins for obfuscation. Migration tools are bundled
• Kerberos Integration
from development to production.
• Sheet dependency graph
• Audit & data volume logs
• Data retention policies Fine-grained, role-based security
• Secure impersonation to HDFS,
YARN, Hive (Sentry), HBase Datameer ensures full control over the data and models. Datameer also keeps a
• Basic Sentry integration
full catalog of datasets, models and derived content, and enables sharing with
• Encrypted metadata (with key
rotation) virtual team members across the organization.
PA G E 8