Meet the Committer: 3 Minutes on Apache Ambari with Mahadev Konar
We’re continuing our series of quick interviews with Apache Hadoop project committers at Hortonworks. This week Mahadev Konar discusses Apache Ambari, the open source Apache project to simplify...
View ArticleWelcoming Julian Hyde to Hortonworks
I’d like to take a quick moment to welcome Julian Hyde as the latest addition to the Hortonworks engineering team. Julian has a long history of working on data platforms, including development of SQL...
View ArticleDelivering on Stinger: a Phase 3 Progress Update
With the attention of the Hadoop community on Strata/Hadoop World in New York this week, it’s seems an appropriate time to give everyone an early update on continued community development of Apache...
View ArticleApache Knox Gateway 0.3.0: Another release of perimeter security for Hadoop
The Apache Knox community announced the release of the Apache Knox Gateway (Incubator) 0.3.0. We, at Hortonworks, are excited about this announcement. The Apache Knox Gateway is a REST API Gateway for...
View ArticleOpenStack: why it’s so great to see HDP in Rackspace cloud
One of the great things about working in open source development is working with other experts round the work on big projects – and then having the results of that work in the hands of users within a...
View ArticleIntroducing Tez Sessions
This post is the seventh in our series on the motivations, architecture and performance gains of Apache Tez for data processing in Hadoop. The series has the following posts: Apache Tez: A New Chapter...
View ArticleResource Localization in YARN: Deep Dive
This post is authored by Omkar Vinit Joshi with Vinod Kumar Vavilapalli and is the ninth post in the multi-part blog series on Apache Hadoop YARN – a general-purpose, distributed, application...
View ArticleUsing Hive to interact with HBase, Part 1
This is the first of two posts examining the use of Hive for interaction with HBase tables. The second post is here. One of the things I’m frequently asked about is how to use HBase from Apache Hive....
View ArticleUsing Hive to interact with HBase, Part 2
This is the second of two posts examining the use of Hive for interaction with HBase tables. This is a hands-on exploration so the first post isn’t required reading for consuming this one. Still, it...
View ArticleSimplifying user-logs management and access in YARN
User logs of Hadoop jobs serve multiple purposes. First and foremost, they can be used to debug issues while running a MapReduce application – correctness problems with the application itself, race...
View ArticleHortonworks Data Platform 2.0 Certified for Ubuntu
In just a few years, interest in Hadoop has enjoyed a meteoric rise. It is everywhere… and it should be available everywhere. Here at Hortonworks we have worked to provide the widest range of...
View ArticleA Roadmap for Hadoop and OpenStack Integration
A recent survey conducted by the OpenStack foundation shows incredible adoption in the enterprise. Cost savings and operational efficiency stand out as the top business motivators that are driving...
View ArticleApache Falcon Technical Preview Available Now
We believe the fastest path to innovation is the open community and we work hard to help deliver this innovation from the community to the enterprise. However, this is a two way street. We are also...
View ArticleApache Ambari graduates to Apache Top Level Project!
We are very excited to announce that Apache Ambari has graduated out of Incubator and is now an Apache Top Level Project! Hortonworks introduced Ambari as an Apache Incubator project back in August...
View ArticleApache Tez 0.2.0 Released
The Apache Tez team is proud to announce the first release of Apache Tez – version 0.2.0-incubating. Apache Tez is an application framework which allows for a complex directed-acyclic-graph of tasks...
View ArticleHadoop Security : Today and Tomorrow
Security is a top agenda item and represents critical requirements for Hadoop projects. Over the years, Hadoop has evolved to address key concerns regarding authentication, authorization, accounting,...
View ArticleHow To Secure Apache Sqoop Jobs with Oracle Wallet
Apache Sqoop is a tool that transfers data between the Hadoop ecosystem and enterprise data stores. Sqoop does this by providing methods to transfer data to HDFS or Hive (using HCatalog). Oracle...
View ArticleHortonworks Data Platform 2.0 on openjdk
Apache Hadoop has always been very fussy about Java versions. It’s a big application running across tens of thousands of processes across thousands of machines in a single datacenter. This makes it...
View ArticleDownloads for Storm, Falcon, Knox Gateway and Tez
Last week was a busy week for shipping code, so here’s a quick recap on the new stuff to keep you busy over the holiday season. Technical Preview of Storm. This preview includes the latest release of...
View ArticleWire Encryption in Hadoop
Encryption is applied to electronic information in order to ensure its privacy and confidentiality. Typically, we think of protecting data as it rests or in motion. Wire Encryption protects the...
View Article