Articles_1.md | Articles_2.md
- Ranger Tagsync KafkaException and run out of the client ports
Issue Resolution
Atlas
Kafka
Ranger
issue-resolution
kerberos
- Ambari 2.2.2.18 is now available
- Dynamically Add Hosts to a Cluster with Blueprints
Ambari
auto-discovery
blueprints
extensibility
provision
- Using Rhive with Kerberized Hadoop Cluster
Hive
faq
hiveserver2
kerberos
r
rhive
- Run Kafka+Storm in a Kerberized HDP cluster
Issue Resolution
Ranger
Storm
faq
issue-resolution
kerberos
- Phoenix Issues When Connecting HBase to Storm
Hbase
Phoenix
Storm
faq
- How to expand existing NiFi cluster fault tolerance using multiple data centers when using HDF 1.x/NiFi 0.x versions. (Part 1 of 2)
How-To/Tutorial
Nifi
fail-over
hdf
high-availability
how-to-tutorial
- How to expand existing NiFi cluster fault tolerance using multiple data centers when using HDF 1.x/NiFi 0.x verisons. (Part 2 of 2)
How-To/Tutorial
Nifi
fail-over
hdf
high-availability
how-to-tutorial
namenode-ha
- Hive and XML Parsing
Hive
xml
xpath
- Replacing Disks on Datanode Hosts
ambari2.4.1.0
datanode
disk
faq
hdp2.5.0
operations
- Deploying the Phoenix Query Server in production environments
How-To/Tutorial
Phoenix
how-to-tutorial
pqs
queryserver
- How to run NiFi as Non-Root User?
Nifi
faq
operations
- How to Stop service using Ambari DB if Ambari API is not working
How-To/Tutorial
Ambari
services
- How to move Docker for Mac vm image from internal to external hard drive.
How-To/Tutorial
Sandbox
docker
how-to-tutorial
- Getting past “Access denied for user” error during Hive Metastore failure
Issue Resolution
Hive
hiveserver2
metastore
mysql
- MP3 Jukebox with NIFI 1.x
How-To/Tutorial
Nifi
- NIFI 1.x For Automatic Music Playing Pipelines
How-To/Tutorial
Nifi
- NiFi: Easy custom logging of diverse sources in mere seconds
How-To/Tutorial
Nifi
groovy
logs
nifi-processor
reuse
- Hadoop enhancements in Isilon OneFS 801
faq
isilon
onefs
- Customizing Ranger Policies with Dynamic Context
How-To/Tutorial
Hive
Ranger
ranger-hive-plugin
- Update Zeppelin JDCB Interpreter To Support Solr SQL Queries
How-To/Tutorial
SOLR
jdbc
solrcloud
sql
zeppelin
- Exploring Apache Flink with HDP
ambari-service
compile
faq
flink
hdp
- NiFi Identity Conversion
How-To/Tutorial
Nifi
active-directory
kerberos
ldap
security
- Running Apache Pig Scripts from Apache NiFi and Storing the Results in HDFS
How-To/Tutorial
Pig
apache-nifi
- Hive on Tez Performance Tuning - Determining Reducer Counts
How-To/Tutorial
Hive
Tez
how-to-tutorial
memory
performance
- Content Data Store
How-To/Tutorial
Hbase
image-extract
- Incrementally Streaming RDBMS Data to Your Hadoop DataLake
How-To/Tutorial
Nifi
sql
- Run Hadoop Happily in O.S. Firewall Controlled Environment
How-To/Tutorial
configs
firewall
hdp-2.3.4
- Running SparkR in RStudio using HDP 2.4
How-To/Tutorial
Spark
rstudio
sparkr
sparksql
- Securing NiFi Step-by-Step
Nifi
security
- Creating a 3 node NiFi cluster using Vagrant and VirtualBox
How-To/Tutorial
Nifi
clustering
how-to-tutorial
vagrant
virtualbox
- Converting a Large JSON File into CSV
How-To/Tutorial
Nifi
csv
json
- Workaround for RegionServer startup failure after Ranger HBase plugin is enabled on a Kerberos-secured cluster - Ambari 2.2
How-To/Tutorial
Ambari
Hbase
Ranger
kerberos
- Resolving Connectivity Issue, rhive
Hive
faq
hive-jdbc
hiveserver2
r
- How to edit the “cluster-env.xml” entries using Ambari Rest APIs
How-To/Tutorial
Ambari
ambari-server
api
curl
- Atlas REST API Search Techniques
How-To/Tutorial
Atlas
atlas-api
- JSON-to-JSON Simplified with Apache NiFi and Jolt
How-To/Tutorial
apache-nifi
jolt
json
- JDBC error with Hive ACID Enabled
How-To/Tutorial
Hive
acid
hiveserver2
jdbc
- Optimizing Performance of Apache NiFi’s Network Listening Processors
How-To/Tutorial
Nifi
hdf
logs
performance
- Knox queries fail quickly with a 500 error
Issue Resolution
Knox
knox-0.5.0
knox-gateway
query
- Monitoring Your Containers with SysDig from HDF 2.0
How-To/Tutorial
Nifi
container
monitoring
- Hortonworks Cloud for AWS Technical Preview Update
How-To/Tutorial
hortonworks-cloud
- Migrate Oozie DB from Derby to Mysql
How-To/Tutorial
Oozie
derby
migration
- Hortonworks Data Cloud for AWS Technical Preview #3 is now available!
How-To/Tutorial
hortonworks-cloud
- Unable to access Ranger protected directories after NFS implementation
Issue Resolution
HDFS
Ranger
nfs
nfsgateway
- Why Kafka operations do not honor Ranger policies for users.
Issue Resolution
Kafka
zookeeper
- Zookeeper Sizing and Placement
faq
sizing
zookeeper
- How to enable deny-conditions and excludes in Ranger policies
How-To/Tutorial
Ranger
api
policies
- UMask vs HDFS default ACLs
HDFS
faq
hdfs-permissions
- Oozie shell action - Run Hive(TEZ) query in shell script via Oozie with Kerberos environment
How-To/Tutorial
oozie-shell
oozie-shell-action
- YARN service check fails with “Configuration parameter ‘yarn.resourcemanager.webapp.address.’ was not found in configurations dictionary!”
Issue Resolution
Ambari
YARN
service
- Import RDBMS into Hive table stored as ORC with SQOOP
Hive
Sqoop
how-to-tutorial
import
orc
sqoop import
- Modify Atlas Entity properties using REST API commands
How-To/Tutorial
Atlas
Hive
atlas-api
faq
- Enabling JMX monitoring for HiveServer2
How-To/Tutorial
Hive
hiveserver2
jmx
- Install HDB (HAWQ) via Ambari and use Zeppelin for visualization
How-To/Tutorial
Ambari
hawq
hdb
zeppelin
- How to change /apps/hive/warehouse directory permission which shows 777 by default.
Issue Resolution
Hive
authorization
faq
- Changing dfs.nameservices value after HDFS HA has been enabled
HDFS
configuration
high-availability
how-to-tutorial
namenode ha
- Aggregation in PIG and storage in HIVE
How-To/Tutorial
Hive
Pig
aggregate
groups
hcat
hcatalog
metadata
metastore
- Importing Apache incubator-ranger Project into Eclipse IDE
How-To/Tutorial
Ranger
eclipse
integration
ranger-admin
- Enable JMX metrics on hadoop using jmxterm
How-To/Tutorial
ambari-metrics
hdp-2.3.4
jmx
- APACHE ZEPPELIN ON HDP 2.4.2
How-To/Tutorial
Spark
notebook
zeppelin
zeppelin-notebook
- Pushing STIX/Taxii feeds from Opentaxii server into HBASE
How-To/Tutorial
Hbase
Metron
extensibility
how-to-tutorial
threat-intel
- Applying Threat Intel Feeds to Telemetry Events with Apache Metron
How-To/Tutorial
Metron
extensibility
how-to-tutorial
threat-intel
- Metron Extensibility: Adding a New Data Source to the Platform
How-To/Tutorial
Metron
extensibility
- Enriching Telemetry Events in Apache Metron.
How-To/Tutorial
Metron
enrichment
extensibility
faq
how-to-tutorial
- Collecting and Parsing Telemetry Events for new Data Source - WIP
How-To/Tutorial
Metron
extensibility
how-to-tutorial
- Ingesting JMS Messages to HDFS via HDF 2.0
How-To/Tutorial
HDFS
Nifi
data-ingestion
ingestion
jms
- HDFS Snapshots - 2) Operations
How-To/Tutorial
HDFS
backup
operations
snapshot
- HDF 2.0: Enable Ranger authorization for HDF components (Nifi, Kafka, Storm)
How-To/Tutorial
Nifi
Ranger
authorization
hdf
hdf-2.0.0
security
- SmartSense 1.3.0 Documentation Updates
documentation
faq
smartsense
- HAWQ - Using custom Kerberos Principal
Issue Resolution
authentication
data-ingestion
hawq
hdb
kerberos
pivotal
postgres
- Spark RDDs vs DataFrames vs SparkSQL
How-To/Tutorial
dataframe
pyspark
rdd
sparksql
- HDF 2.0: Use Ambari to enable kerberos for HDF cluster running Nifi, Kafka and Storm
How-To/Tutorial
Nifi
Ranger
hdf
hdf-2.0.0
kerberos
security
- Issue - Oozie web console is disabled.To enable Oozie web console install the Ext JS library
Issue Resolution
Oozie
extjs
faq
issue-resolution
oozie-ui
- Import hive metadata into Atlas
How-To/Tutorial
Atlas
governance
hiveserver2
metadata
- Knox with HttpFS
- Streaming Ingest of Google Sheets with HDF 2.0
How-To/Tutorial
Nifi
google
hdf-2.0.0
- Installing and Configuring Splice Machine for Hortonworks HDP
How-To/Tutorial
machine
rdbms
- Analyzing images in HDF 2.0 using TensorFlow
How-To/Tutorial
Nifi
hdf
tensorflow
- How to get Atlas up and running in HDP 2.5 Sandbox.
How-To/Tutorial
Atlas
Sandbox
hdp-2.5.0
- Ranger Audit in Hive Table - a sample approach
How-To/Tutorial
Ranger
- HDF 2.0 - Defining NiFi Policies in Ranger
How-To/Tutorial
Nifi
Ranger
hdf
hdf-2.0.0
policies
ranger-admin
- HDF 2.0 - Integrating Secured NiFi with Secured Ranger for Authorization Management
How-To/Tutorial
Nifi
Ranger
hdf-2.0.0
ranger-admin
- NiFi Multitenant Authorization when using Ranger Policies
How-To/Tutorial
Nifi
Ranger
authorization
security
- How to manage multiple copies of the HDP Docker Sandbox.
How-To/Tutorial
Sandbox
containers
docker
hdp-2.3.4
how-to-tutorial
- KAFKA MIRRORING IN HYBRID CLOUD ENVIRONMENT
How-To/Tutorial
Kafka
Spark
aws
- JMeter Setup for Hive Load Testing
- HDF 2.0: Use Ambari to enable kerberos for HDF cluster using Active Directory
How-To/Tutorial
Ambari
Nifi
Ranger
active-directory
hdf
hdf-2.0.0
kerberos
security
- Using Images Stored in HDFS for Web Pages
How-To/Tutorial
HDFS
applicatio
- Working with Variables in Hive (Hive Shell and Beeline),Hive Session Variables
Hive
beeline
cli
faq
hive-jdbc
hive-odbc
hive-views
hiveserver2
- Security and Governance Documentation Updates for HDP-2.5.0
Atlas
Knox
Ranger
faq
governance
kerberos
security
- How to Connect To Hive via Knox Using ODBC
How-To/Tutorial
Hive
installation
linux
odbc
pyodbc
- Ingesting EDI into HDFS using HDF 2.0
How-To/Tutorial
HDFS
Nifi
data-ingestion
ingestion
- Migrating Kafka partitions data to new disk location
How-To/Tutorial
Kafka
- Fixing Ambari-Kafka Alert Errors
- Demystify Knox, LDAP, SSL, CA Cert integration
How-To/Tutorial
Knox
how-to-tutorial
ldap
ssl
- exitCode=7 (Kerberos - YARN local-dirs with noexec mount option)
- Oozie Sqoop Hcatalog least known errors and solutions
Issue Resolution
Oozie
Spark
Sqoop
hcatalog
heartbeat
nullpointerexception
thrift
- Create dynamic row level filter in Ranger
How-To/Tutorial
Ranger
row-level-filtering
- Hive - Understanding concurrent sessions + queue allocation + preemption
How-To/Tutorial
Hive
Tez
YARN
preemption
- Digital Financial Advisor on Apache Spark and Apache Zeppelin
How-To/Tutorial
Spark
finance
zeppelin
- CSV to AVRO Conversion with NiFi Debugging, Checking Schemas
How-To/Tutorial
Kafka
Nifi
avro
csv
- HDF 2.0 Flow for Processing Real-Time Tweets
How-To/Tutorial
hdf
python
tensorflow
tweets
- Fixing RA040 errors in Ambari Views that refer to javax.net.ssl.SSLHandshakeException
Issue Resolution
ambari-server
ambari-views
security
- Apache Phoenix Performance Testing Tools
How-To/Tutorial
Phoenix
apache-phoenix
performance
- Understanding Taxonomy in Apache Atlas
How-To/Tutorial
Atlas
data-management
governance
taxonomy
- Hive data lineage using Apache Atlas
How-To/Tutorial
Atlas
Hive
data-lineage
governance
metadata
- Creating a Process Group for Twitter Data in NiFi
How-To/Tutorial
Nifi
Sandbox
elasticsearch
how-to-tutorial
twitter
- Let’s try a Kaggle Challenge with HDP !
How-To/Tutorial
Hive
Spark
faq
- Phoenix Index Lifecycle
Phoenix
apache-phoenix
faq
- Permission denied: user=yarn, access=WRITE oozie shell action
How-To/Tutorial
Oozie
oozie-shell-action
permission-denied
- Ambari API - Run all service checks (bulk)
Ambari
ambari-service
api
bulk-operations
how-to-tutorial
- 360° of an Oil & Gas well
How-To/Tutorial
Hive
Spark
datascience
demo
dtw
oilandgas
python
- HDP 2.5 Documentation Updates for Streaming Components Storm and Kafka
Kafka
Storm
documentation
faq
kafka-spout
storm-kafka
stream-processing
streaming
- HDP 2.5 Documentation Updates for Data Science Components: Spark, Zeppelin, and HDP Search
SOLR
Spark
documentation
faq
hdpsearch
spark-sql
spark-streaming
zeppelin
zeppelin-notebook
- Parsing evtx files with Apache NiFi
How-To/Tutorial
apache-nifi
hdf
logs
parsers
windows
- Configure SAP Vora HDP Ambari - Part 2
How-To/Tutorial
Hive
Spark
sap
sap-hana
vora
- Load Demo data in SAP Vora Using Eclipse HANA Modelling tools - Part 3
How-To/Tutorial
Hive
Spark
sap
sap-hana
vora
- Perform Data Analysis using SAP Vora on SAP Hana data - Part 4
How-To/Tutorial
Hive
Spark
sap
sap-hana
vora
- Running PySpark with Conda Env
How-To/Tutorial
Spark
conda
pyspark
- Kafka topic creation and ACL configuration for Atlas
Issue Resolution
Atlas
Kafka
governance
issue-resolution
- HowTo install and configure high availability on Atlas?
How-To/Tutorial
Ambari
Atlas
configuration
high-availability
- Multiple organisation ldap search support in Ranger Usersync
How-To/Tutorial
Ranger
ranger-usersync
- Save Spark DataFrame table into Phoenix
Spark
apache-phoenix
faq
- Restricting HiveCLI access to limited users
Hive
authentication
cli
configs
configuration
faq
hivecli
security
- Installing Ambari 2.2.2 (HDP 2.4.3) on Azure (RHEL 6.8)
How-To/Tutorial
Ambari
azure
hdp-2.4.3
rhel6
- Spark + S3A filesystem client from HDP to access S3
How-To/Tutorial
Spark
faq
pyspark
s3
spark-sql
- HDF 2.0 Secure 3 Node Development Cluster in Docker
How-To/Tutorial
Ambari
Nifi
docker
security
- How to start Atlas on Hortonworks Sandbox for HDP 2.5
How-To/Tutorial
Atlas
SOLR
Sandbox
ambari-infra
hdp-2.5.0
- New Hortonworks Data Cloud for AWS Technical Preview is now available!
How-To/Tutorial
hortonworks-cloud
- Offloading Mainframe Data into Hadoop
How-To/Tutorial
hadoop
mainframe
- Product Availability: Ambari 2.4.1 is released and available immediately for your evaluation and use (2016-09-21)
Ambari
faq
release
- Using NiFi GetTwitter, UpdateAttributes and ReplaceText processors to modify Twitter JSON data.
How-To/Tutorial
Nifi
elasticsearch
how-to-tutorial
regular-expressions
twitter
- Setting up a Hadoop/Spark cluster with Docker on a single machine
How-To/Tutorial
Ambari
cluster
docker
hadoop
- Installing and Configuring HBase Indexer in a Kerberized Cluster
How-To/Tutorial
Hbase
SOLR
index
indexer
indexing
solrcloud
streaming
- Tag Hive data using Apache Atlas
How-To/Tutorial
Atlas
Hive
governance
tag
- Oozie kerberized java action to query HiveServer2 using JDBC
How-To/Tutorial
hiveserver2
java
kerberos
oozie-hive
- Logical Disk Encryption - Data at Rest Encryption
HDFS
disk
encryption
faq
luks
- hadoop logs push to Kafka topic using Ambari LogSearch
How-To/Tutorial
Ambari
ambari-log-search
how-to-tutorial
- Replace Knox self-signed certificate with CA certificate
How-To/Tutorial
Knox
how-to-tutorial
knox-gateway
ldap
ssl
- Using Transparent Data Encryption in HDFS (Non-Kerberized cluster)
How-To/Tutorial
HDFS
Ranger
encryption
ranger-kms
- Supporting Custom Properties for Expression Language in NiFi
How-To/Tutorial
Nifi
expression-language
- Blueprint deployed NameNode HA failing on restarting NameNode during EU
Issue Resolution
Ambari
HDFS
ambari-blueprint
issue-resolution
namenode-ha
restart
upgrade
- Save Button Doesn’t Work in SOLR / Banana Dasbhoard
How-To/Tutorial
SOLR
banana
dashboard
- Scaling the HDFS NameNode (part 1)
How-To/Tutorial
HDFS
administration
namenode
scalability
- How to convert the HDP Sandbox into a Vagrant Box on Mac OS X
How-To/Tutorial
Sandbox
hdp-2.5.0
vagrant
virtualbox
- Rolling/Express Upgrade Pre-Checks – Purpose and Remediation
How-To/Tutorial
Ambari
rolling-upg
stack-upgrade
upgrade
- Change Data Capture using NiFi
How-To/Tutorial
Nifi
cdc
mysql
oracle
- Creating a Kibana dashboard of Twitter data pushed to Elasticsearch with NiFi
How-To/Tutorial
Nifi
dashboard
elasticsearch
kibana
- Secure Kafka Java Producer with Kerberos
How-To/Tutorial
Kafka
kerberos
security
- A Secure HDFS Client Example
How-To/Tutorial
HDFS
examples
kerberos
【一个安全HDFS客户端的例子】
- Nifi Log GeoEnrichment and Routing
How-To/Tutorial
Hive
Nifi
logs
security
- Reading Sensor Data from Remote Sensors on Raspberry Pis
How-To/Tutorial
Nifi
api
flask
minifi
mqtt
- Creating a Spring Boot Java 8 Microservice To Read Apache Phoenix Data
How-To/Tutorial
apache-phoenix
microservice
spring
- Enable HTTPS for Ambari using JKS
How-To/Tutorial
Ambari
operations
security
- How to pull data from Twitter and push data to Elasticsearch using NiFi.
How-To/Tutorial
Nifi
Sandbox
elasticsearch
twitter
- How to correctly fill out the Ambari Kerberos wizard (existing AD option)
Ambari
faq
kerberos
- Running Apache Beam Spark Runner on HDP 2.5
How-To/Tutorial
Spark
apache-beam
hdp-2.5
- Product Availability: 2016-09-03 HDP 2.4.3.0 has been released and is available immediately for your evaluation and use
- Configuring Ambari 2.2.2 Hive View with Isilon OneFS 8.0
How-To/Tutorial
hdp2.4
isilon
onefs
view
- Parameters for Multi-Homing
faq
network
- Enabling the Zeppelin Elasticsearch interpreter
How-To/Tutorial
elasticsearch
how-to-tutorial
interpreter
zeppelin
【在Zeppelin中启用Elasticsearch解释器】
- HDP 2.5 documentation updates for Data Movement
Falcon
Flume
Oozie
Sqoop
data-management
data-retention
documentation
faq
hdp-2.5
- Apache NiFi 1.0.0 Kerberos Authentication
How-To/Tutorial
Nifi
hdf
kerberos
security
- Installing Apache Ranger with Ambari Postgresql
How-To/Tutorial
Ambari
HDFS
Ranger
ambari-server
how-to-tutorial
installation
postgres
security
【中文】
- Apache Ranger and HDFS
HDFS
Ranger
how-to-tutorial
security
【Ranger和HDFS】
- Change default permission of hive database
How-To/Tutorial
hdfs-permissions
permission
- Ambari Rolling & Express Upgrade
Ambari
express-upgrade
faq
rolling-upgrade
- Cloudbreak SMTP Configuration v1.3
Cloudbreak
faq
- Using Sqoop to fetch many tables in parallel
How-To/Tutorial
Sqoop
how-to-tutorial
- Converting CSV To Avro with Apache NiFi
How-To/Tutorial
Nifi
apache
avro
csv
- HDF installation on EC2
How-To/Tutorial
Nifi
aws
cloud
faq
hdf
- How to Use Hortonworks Cloud to provision a cluster and experiment with Hive LLAP
How-To/Tutorial
Cloudbreak
Hive
aws
hortonworks-cloud
llap
- HDP upgrade using reinstallation
How-To/Tutorial
Ambari
hdp-2.3.4
upgrade
- Apache NiFi 1.0.0 - Zero-Master Clustering
How-To/Tutorial
Nifi
clustering
hdf
- NiFi 1.0.0 - Unsecured cluster setup
How-To/Tutorial
Nifi
hdf
- HBase Region Normalizer
Hbase
faq
regionsize
- Recommended Way to do HBase Prefix Scan through HBase Java API and HBase-Spark Connector
Hbase
Spark
faq
- Create A Restful API for Nifi - A Walmart Wrapper
How-To/Tutorial
Nifi
api
use-cases
- Oozie ssh action
How-To/Tutorial
Oozie
how-to-tutorial
ssh
- HDP 2.4.0 and Spark 1.6.0 connecting to AWS S3 buckets
How-To/Tutorial
Spark
aws
faq
s3
spark-shell
- How to install and run Spark 2.0 on HDP 2.5 Sandbox
Sandbox
Spark
faq
hdp-2.5
- Hive on Tez vs PySpark for weblogs parsing
How-To/Tutorial
Pig
Spark
Tez
pyspark
weblog
- More Hadoop nodes = faster IO and processing time?
How-To/Tutorial
MapReduce
aws
faq
iaas
performance
teragen
terasort
- Self Service Hadoop – well some starting points
Atlas
Spark
ambari-views
architecture
faq
- Ambari Views 2.4 New Features - Hue to Ambari Migration View
How-To/Tutorial
ambari-view
data-migration
hue
migration
new-feature
- Using the New HiveQL Processors in Apache NiFi 0.7.0
How-To/Tutorial
Hive
Nifi
hdf
- Enable HTTPS for YARN and MAPREDUCE2
How-To/Tutorial
Ambari
MapReduce
YARN
operations
security
- An introduction to Ambari Views 2.4 new feature- Remote cluster configuration
How-To/Tutorial
2.4
ambari-views
configuration
new-feature
- NiFi User Authentication with LDAP
Nifi
how-to-tutorial
ldap
nifi configuration
security
- Using Apache NiFi 1.0.0 with MongoDB
How-To/Tutorial
Nifi
apache-nifi
mongodb
- Performance of Spark on HDP/HDFS vs Spark on EMR
HDFS
Spark
emr
faq
pyspark
- Windows Share + Nifi + HDFS – A Practical Guide
How-To/Tutorial
Nifi
windows
- Auth-to-local Rules Syntax
How-To/Tutorial
auth-to-local
hadoopkerberosname
kerberos
ruleset
- Phoenix Timestamp - Leverage the core ROWKEY data model
How-To/Tutorial
Hive
Nifi
Phoenix
data-model
hadoop
timestamp
- Using Apache NiFi for Slowly Changing Dimensions on Hadoop Part 1
How-To/Tutorial
Hive
Nifi
Phoenix
hadoop
- What is new in HDP 2.5 HBase: enabling/disabling region splits and merges
How-To/Tutorial
Hbase
hdp-2.5
new-feature
- Phoenix Query Server with Microsoft .NET
.net
Phoenix
microsof
microsoft
- Accessing Facebook Page Data from Apache NiFi
How-To/Tutorial
Nifi
social
- Tableau on Spark Cache via ThriftServer
How-To/Tutorial
Spark
ingestion
odbc
spark-sql
sparksql
tableau
thrift
- Querying Data via SparkSQL with ODBC Tools
How-To/Tutorial
Spark
ingestion
odbc
sparksql
- How to install Apache Zeppelin, R, Solr, and Giraph on a ‘Spark’ HDInsights ‘Cluster Type’;
azure
best-practices
faq
hdi
hdinsight
- Apache Ignite “In-Memory Data Fabric”
HDFS
hadoop
how-to-tutorial
ignite
- Create Trait Types in Atlas
Atlas
- SparkSQL jdbc Federation
Spark
faq
federation
jdbc
sparksql
- HDP clients with multi-version and multi-OS support
How-To/Tutorial
hdp-2.3.4
manual_rpm_install
- Comparing all service configurations between clusters
How-To/Tutorial
Ambari
configuration
utilities
- Pig Doing Yoga: How to Build Superflexible Pig Scripts
How-To/Tutorial
Pig
script
- HBase compaction tuning tips
Hbase
compaction
faq
tip
- Oozie coordinator and based on input data events
How-To/Tutorial
Oozie
faq
oozie-coordinator
- Processing Social Media Feeds in Stream with Apache NiFi 1.0.0 and NLTK
How-To/Tutorial
Nifi
python
sentiment-analysis
- How to increase HDFS Balancer network bandwidth for faster movement
HDFS
balancer
command
faq
network
operations
- Solr Indexing the database tables :
How-To/Tutorial
SOLR
indexing
- How to copy encrypted data between two HDP clusters when Ranger KMS is utilized
How-To/Tutorial
distcp
encryption
ranger-kms
security
- EMC Isilon HDP 2.3 and Ambari 2.1 Installation Guide
How-To/Tutorial
Ambari
how-to-tutorial
isilon
- Managing Ambari Users and groups using Rest API
How-To/Tutorial
Ambari
how-to-tutorial
- Automatically deleting Ranger Users from MySQL and Postgres After Using REST API - HDP 2.4
Issue Resolution
Ranger
issue-resolution
mysql
postgres
ui
users
- HDF on HDI - NiFi
How-To/Tutorial
Nifi
hdf
hdi
- How do I login to Zeppelin when Security is enabled using HDP 2.5 Tech Preview Sandbox
How-To/Tutorial
Spark
authentication
security
zeppelin
- Apache Atlas as an Avro Schema Registry Test Drive
How-To/Tutorial
Atlas
Kafka
avro
schema-registry
- Avro Schema Registry with Apache Atlas for Streaming Data Management
How-To/Tutorial
Atlas
Kafka
avro
governance
schema-registry
- Simple Kafka Producer using Java in a Kerberozied cluster
How-To/Tutorial
Kafka
producer
- How to semi-automate deploying dev cluster
How-To/Tutorial
docker
hdp-2.3.4
installation
- Setup cross realm trust between two MIT KDC
How-To/Tutorial
distcp
kerberos
security
- Understanding Apache ZooKeeper Connection Rate Limiting
connections
faq
maxclientcnxns
zookeeper
- How to set up Grafana to use MySQL database rather than the default sqlite
How-To/Tutorial
configuration
grafana
mysql
- How to create and register custom ambari alerts ?
How-To/Tutorial
Ambari
ambari-alerts
faq
how-to-tutorial
- Using Pig to convert uncompressed data to compressed data in HDFS
How-To/Tutorial
HDFS
Hive
Pig
compression
- Apache NiFi 1.0.0-BETA: Using the New ListenSMTP for Mail Routing
How-To/Tutorial
apache-nifi
mail
- 5 Infrequently Known Commands To Debug Your HDFS Issues
How-To/Tutorial
HDFS
debug
tool
- Nifi 1.0.0 Beta UI Introduction
Nifi
apache-nifi
faq
nifi-templates
- Apache Pig IN operator, placeholder until PIG-4931 is closed
How-To/Tutorial
Pig
- Performance Comparison b/w ORC SNAPPY and ZLib in hive/ORC files.
Hive
faq
orc
performance
zlib
- What enhancements does the Apache Storm Release 1.0 bring for real-time streaming systems
Storm
faq
realtime
stream-processing
streaming
- Integrating Hortonworks DataFlow (powered by Apache NiFi) with SAS Event Stream Processing
How-To/Tutorial
Nifi
hdf
- Hadoop Data Node Density Tradeoff
architecture
data-nodes
faq
node-density
sizing
storage
- Demystifying Delegation Token
How-To/Tutorial
kerberos
security
【委派令牌解惑】
- Hive Streaming Compaction
How-To/Tutorial
Hive
- Implementing a real-time Hive Streaming example
How-To/Tutorial
Hive
- How to add sentiment analytics to Twitter/Apache Nifi Demo
How-To/Tutorial
Nifi
SOLR
apache-nifi
banana
how-to-tutorial
nifi-templates
sentiment-analysis
twitter
- Configure TEZ View for Kerberized HDP Cluster.
How-To/Tutorial
Tez
kerberos
view
- YARN Application Monitoring with NiFi
How-To/Tutorial
Nifi
YARN
operations
- Using NiFi to ingest and transform RSS feeds to HDFS using an external config file
How-To/Tutorial
HDFS
Nifi
data-ingestion
faq
groovy
http
nifi-processor
xml
- Mount VirtualBox Shared Folder
Sandbox
faq
share
virtualbox
vm
- Connected Platform Development and Maintenance Tips for Sandboxes
How-To/Tutorial
Ambari
Hive
Nifi
- Scaling the HDFS NameNode (part 3) - RPC scalability features
How-To/Tutorial
HDFS
administration
namenode
scalability
- Connecting Eclipse To Hive
Hive
development
eclipse
jdbc
- deleted /hdp/apps/ dir from hdfs
How-To/Tutorial
HDFS
delete
delete file
trash
- Application Timeline Server (ATS) issue error code: 500 , message: Internal Server Error
Issue Resolution
Nifi
Tez
YARN
issue-resolution
operations
yarn-ats
- Using Apache Flume Sources and Sinks with Apache NiFi 0.70
How-To/Tutorial
Flume
Nifi
faq
【中文】
- Finding Non-Numerics in a File - Pig Alternative
How-To/Tutorial
MapReduce
Pig
etl
- Hive UDFs vs Spatial SQL
How-To/Tutorial
Hive
esri
geospatial
hive-udf
spatial
- Geo-spatial Queries with Hive using ESRI Geometry and Spatial Framework for Hadoop
How-To/Tutorial
Hive
esri
geospatial
hive-udf
- Talks from Hadoop Summit San Jose 2016
faq
hadoop
- Scaling the HDFS NameNode (part 4) - Avoiding Performance Pitfalls
How-To/Tutorial
HDFS
administration
namenode
scalability
- HDP Upgrade : Hive Behavioral changes : .14 to 1.21 (HDP 2.1 to HDP 2.4)
Issue Resolution
Hive
bug
hdp-2.2
hdp-2.4.0
issue-resolution
upgrade
- templeton.libjars property changed its value after upgrade 2.2 to 2.3
- Horses for Courses: Apache Spark Streaming and Apache Nifi
Kafka
Nifi
Spark
faq
spark-streaming
- Analyze Small FIle in HDFS
How-To/Tutorial
HDFS
data
faq
- Small Files in Hadoop
data
faq
small files
- Explore the latest on Apache Hadoop HDFS Summit San Jose 2016
HDFS
faq
- Improving service startup times on OS X development machines
- Configure File view with namenode HA for Kerberized Cluster
How-To/Tutorial
Ambari
ambari-views
faq
kerberos
- Using ExtractMediaMetaData for Image Analysis
- Ambari and Chef. Will this combination be supported in the future?
Ambari
faq
- Why does the download fail when the HDP gsinstaller tries to download Oracle JDK?
faq
gsinstaller
x3rd_party
- Hive scripts used in Hive version 0.9 with the CAST function within a view statement no longer executes successfully in hive 0.11 if columns are not in single quotes.
Hive
issue-resolution
- Securing Solr Collections with Ranger + Kerberos
How-To/Tutorial
Ranger
SOLR
how-to-tutorial
kerberos
security
- Setup Hortonworks Data Platform using Vagrant, VirtualBox and Ambari
How-To/Tutorial
ambari-2.2.2
hdp-2.4.2
vagrant
virtualbox
- Heterogeneous Storage in HDFS(Part-1)…
HDFS
faq
hadoop
storage
- Apache Shiro design is intuitive and a simple way to ensure the safety of the application…
How-To/Tutorial
hadoop
security
shiro
- HBase Replication and comparison with popular online backup programs…
How-To/Tutorial
Hbase
hadoop
replication
- Workaround for SmartSense capture bundle timeout
Issue Resolution
hst
issue-resolution
smartsense
- HTTPFS - Configure and Run with HDP
HDFS
high-availability
httpfs
namenode
webhdfs
- Using Apache NiFi 0.7.0’s New PutSlack Processor
How-To/Tutorial
apache-nifi
nifi-processor
slack
twitter
- OpenHAB - IOT and you can set up
IOT
faq
- Using Teradata JDBC connector in NiFi
How-To/Tutorial
Nifi
database
nifi-processor
teradata
- Past and Future of Apache Kylin
How-To/Tutorial
hadoop
kylin
- Making a Hive UDF From A Useful Existing Library
How-To/Tutorial
Hive
hive-udf
java
udf
- NiFi Security: User Authentication with Kerberos
How-To/Tutorial
Nifi
authentication
how-to-tutorial
kerberos
ldap
security
- Running R program on HDP
analytics
data-science
how-to-tutorial
r
- Data Analysis Approach to a successful outcome…
How-To/Tutorial
analysis
data
data-model
- Hive Naming conventions and database naming standards…
Hive
database
faq
hadoop-ecosystem
- Rest call to ranger on wire encrypted cluster
Issue Resolution
Ranger
hadoop
issue-resolution
security
ssl
- Banana - URL to direct access custom dashboards
How-To/Tutorial
SOLR
banana
dashboard
- HDFS Balancer (2): Configurations & CLI Options
How-To/Tutorial
HDFS
balancer
faq
operations
- Assign IP to HDP 2.5 Sandbox on Virtualbox
How-To/Tutorial
hdp-2.5
ip
virtualbox
- Quickly Adding HDF to HDP 2.5 Sandbox
How-To/Tutorial
Nifi
Sandbox
hdf
hdp-2.5
- Using Kafka Manager with HDP 2.5 Sandbox Kafka
How-To/Tutorial
Kafka
ui
- Create Kafka Topic and Use From Apache NiFi for HDP 2.5 Sandbox
How-To/Tutorial
Kafka
Nifi
Sandbox
- Creating fat jars for Spark Kafka Streaming using sbt
How-To/Tutorial
Kafka
Spark
sbt
streaming
- Accessing Hive on HDP 2.5 Sandbox
How-To/Tutorial
Hive
hdp-2.5
- JRuby code to purge/query/filter data on Hbase over Hive table INT datatype are storing Binary format values at HBase…
How-To/Tutorial
Hbase
Hive
hadoop
jruby
- Scaling the HDFS NameNode (part 2)
How-To/Tutorial
HDFS
administration
namenode
scalability
- JSON to SQL using Spark
How-To/Tutorial
Hive
Spark
flatten
json
spark-sql
- Import HBase data in csv format using pig
How-To/Tutorial
Hbase
Pig
csv
etl
- HDFS Namenode Protection Checklist
How-To/Tutorial
namenode
- Hortonworks Data Platform Development Guide
development
faq
gradle
maven
repository
- Disaster recovery and Backup best practices in a typical Hadoop Cluster :Series 1 Introduction
backup
best-practices
disaster-recovery
faq
- Instrumenting user-defined metric to analysis your topology
How-To/Tutorial
Storm
- Generating Hive Query Metrics and more using “driven”: Part1 setup and installation
Hive
ambari-metrics
driven
faq
ui
- Disaster recovery and Backup best practices in a typical Hadoop Cluster: Series 2 Introduction to Tiered Storage
disaster-recovery
faq
storage
storagepolicies
- Excluding Duplicate Key Columns from Hive using Regular Expressions
How-To/Tutorial
Hive
sql
- How to connect NiFi with Kerberized HDP (Kafka and HDFS)
How-To/Tutorial
HDFS
Kafka
Nifi
kerberos
- Apache Chronos
How-To/Tutorial
chronos
faq
mesos
paas
- Marathon - PaaS to deploy applications
How-To/Tutorial
faq
marathon
mesos
paas
- Apache Mesos - Introduction
faq
mesos
paas
- IntelliJ / Eclipse Usage Against HDP 2.5 Sandbox
How-To/Tutorial
Sandbox
eclipse
intellij
- HAWQ/HDB and Hadoop with Hive and HBase
How-To/Tutorial
Hive
hawq
pivotal
pxf
- Druid - Part 2
druid
faq
- Rack Awareness
HDFS
YARN
faq
rack-awareness
- Druid - Part 1
How-To/Tutorial
druid
- Creating an HBase Coprocessor in Java
How-To/Tutorial
Hbase
Storm
java
- Spark on YARN - Executor Resource Allocation Optimization
How-To/Tutorial
Spark
YARN
faq
memory
- Install HDP 2.3 Cluster on Amazon EC2 using Ambari + Hue 3.9 Installation
Ambari
hdp2.3
how-to-tutorial
hue
installation
- How to create a Hive UDF in Scala
How-To/Tutorial
faq
hive-udf
how-to-tutorial
scala
udf
- Alluxio on HDP 2.4 - In Memory HDFS
How-To/Tutorial
HDFS
Spark
alluxio
- Hive insert query optimization
How-To/Tutorial
Hive
optimization
- Spark+Pycharm+Pybuilder on Docker
How-To/Tutorial
Spark
docker
- Zeppelin Ambari View not being displayed within Sandbox on Virtualbox.
Issue Resolution
Sandbox
ambari-views
zeppelin
- Using Solr’s Extracting Request Handler with Apache NiFi
How-To/Tutorial
Nifi
SOLR
- Hue psycopg2 issue
faq
hue
psycopg2
- Using Hive with PAM Authentication
- Using Apache Atlas to view Data Lineage
How-To/Tutorial
Atlas
Hive
data-lineage
- Hive ODBC Driver on OSX 10.11 (El Capitan)
Hive
odbc
osx
- Big Data Wrangling on HDP with Trifacta - How to Get started
How-To/Tutorial
Sandbox
data-wrangle
etl
trifacta
- Configuring Ranger Policy Administration High Availability
How-To/Tutorial
HDFS
Ranger
configuration
high-availability
how-to-tutorial
security
- Demystify Apache Tez Memory Tuning - Step by Step
Hive
Tez
how-to-tutorial
memory
performance
- Configure Pig View in Kerberized HDP Cluster.
How-To/Tutorial
Pig
ambari-views
【在HDP安全集群中配置Pig视图】
- Make mysql database as Hive’s Metastore:
How-To/Tutorial
Hive
metastore
mysql
- Changing the Log4j debug with
hadoop jar
in MapReduce jobs MapReduce
YARN
debug
faq
log4j
- Creating HIVE partitioned tables using sqoop
How-To/Tutorial
Hive
Sqoop
partition
partitioning
- Sqoop imports from oracle, informix and mysql:
Sqoop
export
faq
how-to-tutorial
mysql
postgres
- Getting familiar with HAWQs Command line Interface
faq
hawq
- Spinning up Hadoop HDP cluster on local machine using Vagrant
How-To/Tutorial
hadoop
linux
vagrant
virtualbox
- WebHCat and WebHDFS tutorial
hcatalog
how-to-tutorial
rest
webhcat
webhdfs
- Choosing Kerberos approach for Hadoop cluster in an enterprise environment
How-To/Tutorial
Ambari
kerberos
operations
security
- Simple example of Jenkins-HDP integration
How-To/Tutorial
automate
automation
benchmark
devops
- Removing the Hive MySQL component from Ambari
How-To/Tutorial
Ambari
Hive
ambari-server
mysql
- Making your cluster aware of multiple Namenode HA
How-To/Tutorial
namenode-ha
- How to troubleshoot Ambari Alerts Notification
Ambari
ambarialerts
faq
notifications
- Installing Lipstick on HDP 2.4
How-To/Tutorial
Pig
install
lipstick
- Order by Operator in Pig
How-To/Tutorial
Pig
- Sandbox - 127.0.0.1:8080 not accessible
Sandbox
connection-refused
issue-resolution
- Mapping Directories from Sandbox VM on VirtualBox to Mac OSX El Capitan
How-To/Tutorial
Sandbox
directories
virtualbox 5 sandbox
- Leveraging the upcoming HIVE 1.3 security UDFs today in HDP 2.3.x
Hive
hive-udf
how-to-tutorial
- Test Driven Development for Big Data (Unofficial Guide) Part 1
How-To/Tutorial
Hive
Pig
hadoop
java
- H100 Unable to submit statement show databases like ‘*’: org.apache.thrift.transport.TTransportException: java.net.SocketException: Broken pipe”
Hive
faq
h100
- HDP 2.3.2 sandbox - sqoop error stating “No statements may be issued when any streaming result sets are open..”
Hive
Sqoop
data-ingestion
faq
mysql
- Creating a Hive UDF in Java
How-To/Tutorial
Hive
java
udf
- Installing HAWQ on 2.4.0.0 Hortonworks Sandbox
faq
hawq
- Enabling Https for AmbariServer and troubleshooting secure communication
How-To/Tutorial
Ambari
ambari-server
security
ssl
- HAWQ Hierarchy basics
faq
hawq
- List Atlas Tags and Traits
How-To/Tutorial
Atlas
tag
- Sqooping Oracle Data simple steps
How-To/Tutorial
Sqoop
oracle
- Hortonworks Secure Cluster with Isilon OneFS
how-to-tutorial
isilon
kerberos
security
- Remote Debugging Ranger process
How-To/Tutorial
Ranger
remote-debug
- How to construct complex filter hierarchy in HBase Using Stargate REST API
How-To/Tutorial
Hbase
- Cheat Sheet and Tips for a Custom Install of Hortonworks Data Platform like a Pro
How-To/Tutorial
Ambari
configuration
installation
- Using Parsey McParseFace (Google TensorFlow Syntaxnet) From Apache NiFi (Draft)
How-To/Tutorial
Nifi
jq
tensorflow
- Troubleshooting tips for running job on secure hbase cluster
Hbase
faq
kerberos
security
- Ambari Admin Utility - Part 1
How-To/Tutorial
Ambari
faq
- Ranger is not allowing access to Knox services when groups are used to define policies.
Issue Resolution
Knox
Ranger
groups
issue-resolution
- Monitoring Kafka with Burrow - Part 1
How-To/Tutorial
Kafka
monitoring
- Tutorial: Install/Configure iPython and create/run PySpark Notebook
How-To/Tutorial
Ranger
Spark
ipython
- Enriching and Munging Twitter Data with HDF
How-To/Tutorial
Nifi
hdf
twitter
- Interacting with HDFS using native linux commands through an NFS Gateway
How-To/Tutorial
Ambari
HDFS
linux
nfs
- Upgrade instructions to HDP 2.3 with OneFS
hdp on isilon
isilon
onefs
upgrade
- Predicting Stock Portfolio Gains using Monte Carlo Simulation in Spark + Zeppelin
How-To/Tutorial
Spark
data-science
finance
spark-1.6.0
spark-sql
zeppelin
- Predicting stock portfolio losses using Monte Carlo simulation in Spark
How-To/Tutorial
hdp-2.4.0
spark-1.6.0
spark-sql
- Telecom DeviceManagerDemo
How-To/Tutorial
Hbase
Kafka
Nifi
Storm
- HDP services supporting RESTFul read, write and execute
Flume
Hbase
Hive
Sqoop
faq
- Visualize patients’ complaints to their doctors using NiFi and Solr/Banana
How-To/Tutorial
Nifi
SOLR
hl7
- Hadoop and LDAP: Usage, Load Patterns and Tuning
Issue Resolution
HDFS
hadoop
ldap
performance
- Change ambari alert threshold values for disks
ambari alerts
ambari-alerts
ambari-server
disk alert
how-to-tutorial
- Next Steps after deploying Hortonworks Data Platform Standard on Azure
azure
faq
hdp2.4
- Hive and LDAP integration
How-To/Tutorial
Hive
ldap
- Add/Remove external LDAP users to/from an internal group in RangerUI
Issue Resolution
HDFS
Ranger
how-to-tutorial
issue-resolution
process-groups
ranger-usersync
security
user-groups
- KMS gets 500 error when decrypting files when being accessed from another one way trust realm.
Issue Resolution
auth-to-local
issue-resolution
kms
ranger-kms
rules
- Tips on troubleshooting WebHbase connection through Knox.
How-To/Tutorial
Hbase
Knox
connection
tip
- Tips to set up knox topology lookup feature from ranger repository
How-To/Tutorial
Knox
Ranger
knox-gateway
ranger-admin
- Spark to parse Weblogs text files and write output to Parquet format
How-To/Tutorial
Spark
parquet
pyspark
zeppelin-notebook
- Accessing Kerberos enabled Kafka topics using GetKafka/PutKafka Processor
How-To/Tutorial
Nifi
kerberos
nifi-processor
- Debugging an Apache Storm topology
How-To/Tutorial
Storm
debug
- Apache Spark Performance Improvement on NUMA Capable Hardware
Spark
faq
performance
- Smartsense agent dying
Issue Resolution
agent
hst smartsense
smartsense
- [How-To] Resolving Ambari api error when executing GET call with Curl
Issue Resolution
Ambari
curl
get
issue-resolution
- Sample code to automate interacting with Zeppelin Interpreter APIs
How-To/Tutorial
Hive
hawq
interpreter
zeppelin
- How to connect to HBase 1.1 using Java
Hbase
Sandbox
code
hdp 2.3
java
- Write / Read Parquet File in Spark
Spark
faq
parquet
- Investigating Twitter Heron
How-To/Tutorial
Storm
heron
java
- How to solve Ambari Metrics corrupted data
Issue Resolution
Ambari
ambari-metrics
issue-resolution
- How to access HDFS Files using Spark through HA configuration in R
How-To/Tutorial
Spark
hdfs-ha
r
spark-sql
- Monitor Hadoop JVMs with jVisualVM
How-To/Tutorial
Hbase
garbage-collector
java
monitoring
- How to resolve CSRF protection error while adding service through Ambari api
How-To/Tutorial
Ambari
ambari-service
- Making it not rain with Apache NiFi
How-To/Tutorial
IOT
Nifi
raspberry
- Data Ingest with Apache Zeppelin + Apache Spark 1.6 + Hive
How-To/Tutorial
Spark
orc
spark-sql
zeppelin
- Moving the Ambari Default Database on Sandbox
Issue Resolution
Ambari
Sandbox
hdb
postgres
- Using GUI SQL Tools Against Hive on HDP from MacOSX
How-To/Tutorial
Hive
sql
tool
- Use Grafana with Ambari as a Data Source
Ambari
ams
faq
grafana
metrics
- Apache Zeppelin and SparkR
Spark
how-to-tutorial
sparkr
zeppelin
- HDB and Hadoop with Hive and HBase - Query federation
How-To/Tutorial
Hive
faq
hawq
hdb
- How to inspect SmartSense bundle contents
How-To/Tutorial
operations
security
smartsense
- What is Apache Marathon? Part 2
How-To/Tutorial
marathon
mesos
operations
- What is DC/OS? Part 1 (Unofficial)
How-To/Tutorial
mesos
operations
- Spark 1.6 Tips in Code and Submission
How-To/Tutorial
Spark
scala
- Ranger HDFS repository test connection getting Failed
Issue Resolution
HDFS
Ranger
repository
- Troubleshooting Kafka Upgrade
Issue Resolution
Kafka
issue-resolution
upgrade
- Read from HDFS in R script
How-To/Tutorial
data-science
r-hdfs
rhadoop
- Apache Metron Tech Preview 1 - Come and Get It!
How-To/Tutorial
Metron
tech-preview
- Apache Metron TP1 Blog Series
How-To/Tutorial
Metron
tech-preview
- Apache Chronos - Part 3
How-To/Tutorial
chronos
mesos
operations
- Monitoring Kafka with Burrow - Part 2
How-To/Tutorial
Kafka
cluster
monitoring
- How to Index PDF File with Flume and MorphlineSolrSink
How-To/Tutorial
Flume
SOLR
- How does Ambari-2.2.1.1 protect sensitive credentials and configurations
Ambari
configuration
faq
security
- Swappiness setting recommendation
faq
help
memory
operations
optimization
- Kafka MirrorMaker
How-To/Tutorial
Kafka
mirroring
replication
security
- How to modify ambari alert using POST/PUT action
How-To/Tutorial
Ambari
alerts
ambari alerts
post
- NiFi + Mac Dictation: Retrieving real-time quotes on voice commands
How-To/Tutorial
Nifi
hdf
http
speech
- How to Abort a hung ambari operation
Ambari
api
faq
hung
- Installing Hive ODBC with iODBC 3.52.10 on Mac OS X
- sqoop wrong password with password-file
Issue Resolution
Sqoop
password-file
- Apache Metron TP 1 Install Instructions- Single Node Vagrant Deployment
How-To/Tutorial
Metron
installation
tech-preview
- Zeppelinhub Viewer
How-To/Tutorial
Spark
faq
python
visualization
zeppelin
zeppelin-notebook
- Best Practices In HDFS Authorization with Apache Ranger
How-To/Tutorial
HDFS
Ranger
authorization
best-practices
how-to-tutorial
security
【中文】
- What if your Hadoop application get stuck
Issue Resolution
MapReduce
YARN
issue-resolution
yarn-scheduler
- Comparison of NiFi to Python for streaming application
Nifi
architecture
faq
python
streaming
- Hadoop as an Application PaaS with Slider and Docker
How-To/Tutorial
YARN
slider
zookeeper
- How to Write Storm HBase Bolt in a Kerberized HDP Cluster
How-To/Tutorial
Hbase
Storm
security
- Windowing and State checkpointing in Apache Storm
How-To/Tutorial
IOT
Storm
how-to-tutorial
realtime
stream-processing
- Enabling Falcon Authorization
How-To/Tutorial
Falcon
faq
user-groups
users
- How to simulate a Sales Executive with HDF
How-To/Tutorial
Nifi
hdf
- Apache Metron - First Steps in the Cloud
How-To/Tutorial
Metron
- Parsing XML Logs With Nifi – Part 1 of 3
How-To/Tutorial
Nifi
logs
xml
- A quick light-weight Sandbox environment with Apache Bigtop
How-To/Tutorial
bigtop
docker
operations
- Working with Ambari REST API - Automate NIFI Install and Config on Ambari
How-To/Tutorial
Ambari
Nifi
rest
- Virtual Integration of Hadoop with External Systems
faq
federation
integration
sparksql
virtualization
- Fixing broken tar.gz and jar files in HDP 2.4
Issue Resolution
MapReduce
Spark
hdp-2.4
issue-resolution
- Using Spark to Virtually Integrate Hadoop with External Systems
How-To/Tutorial
Hive
Spark
faq
federation
integration
spark-sql
sparksql
virtualization
zeppelin
- Geo Distance calculations in Hive and Java
How-To/Tutorial
Hive
geospatial
hdp-2.4.0
java
- Measuring HDP Performance, Scale and Reliability
performance
test
- Ambari Alerts Phantom or False Alerts on Kerberized cluster with Ambari 2.1.2
ambari-2.1.2
ambari-alerts
how-to-tutorial
phantom alerts
stale alerts
- How to Run Apache Metron Tech Preview in another Region / Availability Zone
How-To/Tutorial
Metron
aws
tech-preview
- Configuring HBase Replication
How-To/Tutorial
Hbase
copy
replication
- Deploy HDP 2.3.x cluster with Zeppelin 0.5.5 using Ambari blueprints
- How to change Ambari alert threshold values for disks
How-To/Tutorial
Ambari
ambari-alerts
disk
disk alert
- Running Spark in Production?
Spark
faq
sparkperformance
sparksecurity
- Syslog Forwarding to NiFi on your Mac
How-To/Tutorial
Nifi
listensyslog
mac
syslog
- How to configure HDF 1.2 to send to and get data from Kerberized Kafka in HDP.
How-To/Tutorial
Kafka
Nifi
hdf
kerberos
- Starting Spark jobs via REST API on a kerberized cluster
How-To/Tutorial
Knox
Spark
YARN
api
kerberos
- Starting Spark jobs directly via YARN REST API
How-To/Tutorial
Spark
YARN
api
- What is Snakebite ? and How to use it with HDP ?
How-To/Tutorial
HDFS
faq
snakebite
- Apache Drill (unofficial) - Introduction
Hbase
Hive
drill
how-to-tutorial
- Azure Sandbox HDP 2.3.2 is not allowing to log into ambari with user admin and password admin
Issue Resolution
Sandbox
azure
hdp-2.3.2
issue-resolution
login
- Apache Calcite - Introduction and Demo
How-To/Tutorial
calcite
- Apache Metron TP1 Deep Dive
How-To/Tutorial
Metron
tech-preview
- How QJM Works in Namenode HA
faq
journalnode
namenode
namenode-ha
- Apache NiFi - Part 2 (Twitter Flow)
Nifi
data flow nifi
data ingestion
hdf
how-to-tutorial
- Apache Metron User Personas and Why Metron?
How-To/Tutorial
Metron
tech-preview
user-personas
- RANGER Policies are not visible/disappeared after HDP Upgrade
Issue Resolution
Ranger
policies
ranger-admin
upgrade
- HTTPS Endpoint in NiFi Flow
How-To/Tutorial
Nifi
java
wire-encryption
- Ranger Upgrade fails with incorrect key file for table ‘xa_access_audit’
Issue Resolution
Ranger
issue-resolution
mysql
upgrade
xasecure
- Apache Metron vs. OpenSoc
Metron
OpenSoc
faq
- How to control size of log files for various HDP components?
faq
hdp 2.3
how-to-tutorial
log4j
logging
logs
- Azure Sandbox prep for Twitter/HDP/HDF demo
How-To/Tutorial
Nifi
SOLR
Sandbox
azure
banana
faq
hdf
twitter
- HiveServer2 JDBC Connection URL Examples
How-To/Tutorial
hiveserver2
how-to-tutorial
jdbc
- Installing Spark 1.6 on HDP 2.3.x
How-To/Tutorial
Spark
faq
hdp-2.3.0
hdp-2.3.2
hdp-2.3.4
- Pig Parameter Substitution with WebHCat and Hue
Pig
error
hue
webhcat
- Ranger Hive Repo Test Connection not working with HS2 HTTP TransportMode configuration
Issue Resolution
Ranger
connection
hiverserver2
repository
- Ambari experimental functionality
Ambari
ambari-service
how-to-tutorial
- Spark DataFrame to Solr Cloud - runs on Sandbox 2.3.2
SOLR
Spark
how-to-tutorial
solrcloud
sparksql
- NiFi + Graylog Integration
How-To/Tutorial
Nifi
logs
sql
- Change default user and password Cloudbreak from an AWS image deployment
Cloudbreak
configuration
faq
- EsgynDB based on Apache Trafodion and HDP 2.3 sandbox
How-To/Tutorial
HDFS
Hbase
Sandbox
trafodian
- Map Hive jobs to YARN queues
How-To/Tutorial
Hive
YARN
map
yarn-scheduler
- Pragmatic Kafka Security Setup 0.9, Java Producer Code
How-To/Tutorial
Kafka
Storm
authorization
deployment best practice
faq
security
ssl
- Trafodion + Zeppelin on HDP2.3 Sandbox:
How-To/Tutorial
Sandbox
hdp-2.3.0
zeppelin
zeppelin-notebook
- HDInsight Component Comparison to HDP
azure
faq
hdi
hdinsight
hdp-2.3.4
- How to check what version of Sandbox I downloaded?
how-to-tutorial
sandbox-version
- NiFi Hypothetical Disk Layout and RAID Configuration
How-To/Tutorial
Nifi
nifi configuration
raid
repo
- HDFS to Teradata - example
Sqoop
faq
terdata
- Cloudbreak 1.1.0 on Azure Deployment Tips
Cloudbreak
azure
faq
microsoft azure
- How to identify what is consuming space in HDFS
How-To/Tutorial
HDFS
disk
space
usage
- Install Apache Hawq on HDP 2.3.4
How-To/Tutorial
hawq
hdp-2.3.4
sql
- How to Increase HDP Sandbox Disk Space
Sandbox
space
- Running Hive in Oozie with Hive2Action and Password Files
- Disable SSLv3 for Hue v2.6.1
Issue Resolution
hue
ssl
sslv3
- Ambari Metric Server basic tuning
How-To/Tutorial
Ambari
ams
metrics
- Tutorial: A short primer on Scala - Issue Resolution
Spark
faq
issue-resolution
scala
tutorial
- Tutorial: Hands-on Tour of Apache Spark in 5 Minutes - Issue Resolution
Issue Resolution
Spark
hdp-2.3.2
issue-resolution
tutorial-360
- Using Ambari to check when upgrade / downgrade seems to be stuck
How-To/Tutorial
Ambari
downgrade
upgrade
- How to limit the size of ranger log and number of log files to retain?
Ranger
faq
how-to-tutorial
logs
- Getting started with Nifi expression language and custom Nifi processors on HDP sandbox
How-To/Tutorial
Hive
Nifi
SOLR
banana
expression-language
faq
hdf
how-to-tutorial
nifi-processor
twitter
- Ambari on EC2
Ambari
aws
ec2
faq
- HDFS Permission Checks
HDFS
documentation
faq
hdfs-permissions
security
- New Visualization Feature in Hive View
ambari-2.1.2
documentation
hive view
visualization
- Centrify Integration With HDP
authentication
centrify
faq
hdp
security
- Yarn ATS 1.5 requires Tez Client be installed on node
Issue Resolution
Tez
YARN
ats
- Storm Serialization with Avro (using Kryo Serializer)
How-To/Tutorial
Storm
avro
- My First Express Upgrade
How-To/Tutorial
Ambari
upgrade
- Comparison of HttpFs and WebHDFS
faq
httpfs
webhdfs
- Hive ACID - Current state
Hive
faq
- Access Kerberos cluster from JAVA using cached ticket
How-To/Tutorial
faq
java
kerberos
windows
- Kerberos: The Missing Guide
faq
java
kerberos
operations
security
- Visualize near-real-time stock price changes using Solr and Banana UI
Nifi
SOLR
banana
faq
how-to-tutorial
- Priority of a Hadoop job
How-To/Tutorial
hadoop
jobs
- How to assign capacity scheduler queue based on AD group?
How-To/Tutorial
YARN
faq
queue
yarn-scheduler
- Shell action in oozie workflow via Hue
Oozie
faq
hue
oozie-shell-action
oozie-shell-hue
- WARNING: setpgid(31734, 0) failed - [Errno 13] Permission denied
How-To/Tutorial
31734
ambari server
- Notes on Big Data Governance
Atlas
Falcon
faq
waterline
- HDFS Data Durability and Availability with replication = 3
HDFS
faq
replication
- Ranger delete process and Hive errors
How-To/Tutorial
Ranger
- How-To: Cleanup SolrCloud entries in ZooKeeper
SOLR
best-practices
how-to-tutorial
solrcloud
zookeeper
- NiFi - Understanding how to use Process Groups and Remote Process Groups.
How-To/Tutorial
Nifi
hdf
process-groups
- Ranger SSL - pitfalls
How-To/Tutorial
Ranger
https
ranger-0.4.0
ssl
- Solutions for Storm Nimbus Failure
Storm
issue-resolution
nimbus
- HiveWebInterface - Example
How-To/Tutorial
Hive
http
- Yarn Queue Utilization - Ambari Widget
How-To/Tutorial
YARN
ambari-metrics
- Write or Append failures in very small Clusters, under heavy load or crash testing
Issue Resolution
append
issue-resolution
replicanotfoundexception
- Is there a way to convert locally managed table to external table?
How-To/Tutorial
Hive
- How to Move or Change HDFS DataNode Directories
HDFS
- NodeManager Web UI connection timeouts; always 5 seconds
Issue Resolution
HDFS
YARN
nodemanager
- Geospatial Data Analysis in Hadoop
Hive
esri
geospatial
gis
how-to-tutorial
- Security in “Enterprise Ready” Data Lake
Atlas
Ranger
faq
governance
security
- How To Best Resolve - RMStateStore FENCED?
Issue Resolution
MapReduce
rmapproot
- OSquery - Tool to troubleshoot OS processes
how-to-tutorial
osquery
troubleshooting
- OLAP in Hadoop - Introduction ( Part 1 )
atscale
druid
faq
kylin
olap
- Error accessing Apache Zeppelin Notebook tab
Issue Resolution
firewall
zeppelin
zeppelin-notebook
- Best Practices: Linux File Systems for HDFS
HDFS
best-practices
faq
filesystem
linux
- Impact of HDFS using Cloudbreak Scale Down
Cloudbreak
HDFS
faq
scale
storage
- Access HDFS file extended attributes in Hive with Groovy UDF
HDFS
Hive
faq
groovy
how-to-tutorial
metadata
- HDP 2.2 Sandbox - How To Fix the error JA002: Unauthorized connection for super-user: oozie from IP 127.0.0.1
Oozie
Sandbox
how-to-tutorial
- Getting started with Hortonworks Sandbox on Azure
- HDInsight Deployment Best Practices
azure
best-practices
cloud
hdinsight
- ERROR : Error: Could not find or load main class org.apache.ambari.server.DBConnectionVerification
Issue Resolution
Ambari
ambari-service
faq
issue-resolution
upgrade
- Accelerating Streaming Analytics with Spark and HDF
Nifi
Spark
faq
how-to-tutorial
- measuring network latency between nodes
how-to-tutorial
iperf
latency
network
- Apache Ranger and HBase
Hbase
Knox
Ranger
hdp-2.3.0
how-to-tutorial
- A quick skinny on Apache Kylin
faq
kylin
olap
- Getting assistance for Sandbox issues in the Sandbox Forums
Sandbox
help
how-to-tutorial
- GoHadoop
YARN
- Apache Nifi (aka HDF) data flow across data center
Nifi
faq
hdf
how-to-tutorial
- Hive OOM - Caused by: java.lang.OutOfMemoryError: Java heap space
Issue Resolution
Hive
heap
issue-resolution
- Yarn Node Labels
YARN
how-to-tutorial
node label
yarn node labels
yarn-cluster
yarn-node-labels
- HPs BDRA - What is different from Traditional Hadoop Architectures?
architecture
faq
hp
- Kerberos Ticket Error - Cache in IPA /RedHat IDM (KEYRING) SOLVED!!
issue-resolution
kerberos ipa keyring
- Yarn queues and CS view - Queue Mapping
YARN
capacity scheduler
capacity scheduler queue
how-to-tutorial
- Yarn queues - No Capacity Scheduler view
YARN
capacity scheduler
capacity scheduler queue
how-to-tutorial
- Apache Ranger and Yarn setup - Security
Ranger
YARN
how-to-tutorial
security
- Introduction to Presto
faq
presto
- What does the “Skip group modifications” option actually do on a new Ambari Install
Ambari
faq
installation
- HDP deployment in (Azure) using CloudBreak
Cloudbreak
azure
cloud
hdp
how-to-tutorial
- Apache Zeppelin Walk Through
Spark
faq
notebook
zeppelin
zeppelin-notebook
- NiFi/HDF Dataflow Optimization (Part 1 of 2)
Nifi
dataflow
hdf
how-to-tutorial
- NiFi/HDF Dataflow Optimization (Part 2 of 2)
Nifi
dataflow
hdf
how-to-tutorial
- Kafka Benchmarking
Kafka
Storm
benchmark
deployment best practice
faq
how-to-tutorial
performance
- A Collection of NiFi Examples
Nifi
examples
faq
hdf
- How to Migrate a Standalone NiFi into a NiFI Cluster
Nifi
cluster
hdf
how-to-tutorial
migration
- Troubleshooting an Oozie Flow
Oozie
how-to-tutorial
troubleshooting
- Hive and Google Cloud Storage
Hive
cloud
gce
google
how-to-tutorial
- HDP with Isilon: Certified and ready for any Hadoop workload
certification
faq
isilon
storage
- Apache NiFi & May the Force be with you
Nifi
SOLR
data flow nifi
data ingestion
hdf
how-to-tutorial
- Apache Hive Groovy UDF examples
Hive
groovy
hive-udf
how-to-tutorial
- Apache Zeppelin (Hive & Spark Demo)
Spark
how-to-tutorial
sparksql
zeppelin-notebook
- Apache Hive CSV SerDe Example
Hive
hadoop
how-to-tutorial
serde
- Hiveserver alert after enabling Kerberos in HDP2.2.3/2.2.4
Ambari
Hive
hiveserver2
issue-resolution
kerberos
- Hidden Gem in HDP sandbox. SSH Web Server on port 4200
Sandbox
faq
ssh
- Best Practice: ‘chroot’ your Solr Cloud in ZooKeeper
SOLR
best-practices
faq
solrcloud
zookeeper
- How to open HCC/AH as an Ambari view
ah
ambari views
how-to-tutorial
- Should you restrict ports within a cluster with a firewall? What are ports that Hadoop uses?
faq
firewall
- Create a Hive Script to Validate Tables
Hive
how-to-tutorial
- Interesting talks for Hadoop Summit 2016 (EMEA)
Spark
Storm
data-science
faq
hadoopsummit
- Working with firewalled HDP via SSH Tunnel and SOCKS Proxy
Hive
how-to-tutorial
jdbc
odbc
security
- Ambari with Postgres HA
Ambari
faq
postgres
- Configuring HDP Security with Active Directory/IPA
Knox
Ranger
active-directory
faq
ipa
ldap
- How To Decrypt OpenSSL-encrypted Data In Apache NiFi
Nifi
aes
encryption
issue-resolution
openssl
- DISTCP errors when copying to AWS in HDP 2.3.x
HDFS
aws
distcp
faq
- The Hadoop Ecosystem Table
ecosystem
faq
hadoop
hadoop-ecosystem
- Ambari LDAP sync
- Quick Presto history
faq
presto ansi sql hadoop
- Use OpenTSDB to store/visualize stock data on HDP sandbox
Hbase
ambari-service
how-to-tutorial
opentsdb
stock data
- SQL Based authorization in hive
- Java Client connecting to Secure Cluster in Non-Default Realm, or two Secure Clusters in Different Realms
Hive
how-to-tutorial
kerberos
realms
security
- MergeContent Processor Inner Workings
Nifi
dataflow
hdf
mergecontent
- Strict Access to Encrypted Zone in Transparent Data Encryption
encryption
security
tde
transparent data encryption
wire-encryption
- Hive CLI Security
Hive
authorization
security
- LinuxContainerExecutor Security Best Practices
YARN
best-practices
security
- Best Practices for using HCC
comments
community
feedback
forum
replies
- Azure (Linux VM) & HDP Demo
azure
azure hadoop
iaas
microsoft azure
wasb
- How to Search for Text in an Image
- Getting started with SQLStdAuth
Hive
authorization
sqlstdauth
- Set dfs.namenode.accesstime.precision from Ambari in HDP-2.2
Ambari
ambari-2.1.0
ambari-2.1.1
hdp
- Sample Application to write to a Kerberised HBase
Hbase
java
kerberos
- Adding a Service RPC Port to an Existing HA Cluster with ZKFCs
- Use WASB as HDP 2.3.2 File System
- Tip: Bend those Connections
Nifi
dataflow
hdf
tip
- Tip: Templates with Dependent Controller Services
Nifi
Phoenix
dataflow
hdf
sql
- Tips on Storage Options for HDP on Amazon Web Services
aws
best-practices
- SSH Issue with a Private Key Asking for Password
ssh
- Configuring YARN Capacity Scheduler with Ambari
- Unable to delete STORM REST API service component after upgrade
Storm
ambari-2.1.1
hdp-2.2.0
upgrade
upgrades
- Moving Oozie to MySQL with Ambari
Oozie