Go to main content
1/81
Contents
Title and Copyright Information
Preface
About this guide
Audience
Conventions
Contacting Oracle Customer Support
1
Introduction
BDD integration with Spark and Hadoop
Secure Hadoop options
Kerberos authentication
TLS/SSL and Encryption options
Preparing your data for ingest
2
Data Processing Workflows
Overview of workflows
Workflow for loading new data
Working with Hive tables
Sampling and attribute handling
Data type discovery
Studio creation of Hive tables
3
Data Processing Configuration
Date format configuration
Spark configuration
Adding a SerDe JAR to DP workflows
4
DP Command Line Interface Utility
DP CLI overview
DP CLI permissions and logging
DP CLI configuration
DP CLI flags
Using whitelists and blacklists
DP CLI cron job
Modifying the DP CLI cron job
DP CLI workflow examples
Processing Hive tables with Snappy compression
Changing Hive table properties
5
Updating Data Sets
About data set updates
Obtaining the Data Set Logical Name
Refresh updates
Refresh flag syntax
Running a Refresh update
Incremental updates
Incremental flag syntax
Running an Incremental update
Creating cron jobs for updates
6
Data Processing Logging
DP logging overview
DP logging properties file
DP log entry format
DP log levels
Example of DP logs during a workflow
Accessing YARN logs
Transform Service log
7
Data Enrichment Modules
About the Data Enrichment modules
Entity extractor
Noun Group extractor
TF.IDF Term extractor
Sentiment Analysis (document level)
Sentiment Analysis (sub-document level)
Address GeoTagger
IP Address GeoTagger
Reverse GeoTagger
Tag Stripper
Phonetic Hash
Language Detection
8
Dgraph Data Model
About the data model
Data records
Attributes
Assignments on attributes
Attribute data types
Supported languages
9
Dgraph HDFS Agent
About the Dgraph HDFS Agent
Importing records from HDFS for ingest
Exporting data from Studio
Dgraph HDFS Agent logging
Log entry format
Logging properties file
Index
Scripting on this page enhances content navigation, but does not change the content in any way.