Apache Hadoop Operations at Scale
Hadoop Summit Content Curation Although the Hadoop Summit San Jose 2014 has come and gone, the invaluable content—keynotes, sessions, and tracks—is available here. I’ve selected a few sessions below...
View ArticleEnabling Kerberos on HDP and Integrating with Active Directory
Hadoop is a business-critical data platform at many of the world’s largest enterprises. These corporations require a layered security model focusing on four aspects of security: authentication,...
View ArticleFour Steps Strategy for Incremental Updates in Apache Hive on Hadoop
Incremental Updates Hadoop and Hive are quickly evolving to outgrow previous limitations for integration and data access. On the near-term development roadmap, we expect to see Hive supporting full...
View ArticleApache Hadoop YARN Ready Webinars
As part of our YARN Ready program, we are hosting a series of technical webinars highlighting the technologies and resources available to developers for creating YARN applications. In our first...
View ArticleAnnouncing Apache Pig 0.13.0
The Apache Pig community released Pig 0.13. earlier this month. Pig uses a simple scripting language to perform complex transformations on data stored in Apache Hadoop. The Pig community has been...
View ArticleApache Tez Graduates to Top-Level
Last week, Apache Tez graduated to become a top level project within the Apache Software Foundation (ASF). This represents a major step forward for the project and is representative of its momentum...
View ArticleApache Hadoop YARN: Present and Future
Hadoop Summit Content Curation Although the Hadoop Summit San Jose 2014 has come and gone, the invaluable content—keynotes, sessions, and tracks—is available here. We ’ve selected a few sessions for...
View ArticleApache Ambari 1.6.1 Released
Earlier this month, the Apache Ambari community released Apache Ambari 1.6.1, which includes multiple improvements for performance and usability. The momentum in and around the Ambari community is...
View ArticleIs Your Hadoop Cluster Bursting at the Seams? Use Apache Ambari to Expand It
Apache Hadoop clusters grow and change with use. Maybe you used Apache Ambari to build your initial cluster with a base set of Hadoop services targeting known use cases and now you want to add other...
View ArticleDeploy Apache Ambari on HDP Clusters with StackIQ
StackIQ, a Hortonworks technology partner, offers a comprehensive software suite that automates the deployment, provisioning, and management of Big Infrastructure. In this guest blog, Anoop Rajendra...
View ArticleSearching for the Apache Hadoop Provisioning Swiss Army Knife
SequenceIQ provides an API and platform to build predictive applications and turn data into tangible assets. In this guest blog, SequenceIQ Co-founder and CTO Janos Matyas (@sequenceiq), explains why...
View ArticleOnboarding Long Running Services to Apache Hadoop YARN Using Apache Slider
Apache Hadoop has come along a long way. From its early days as a platform to index the web, it has evolved to its current interactive, real-time, and batch processing capabilities spanning gigabytes...
View ArticleThe Future of Apache Ambari
It’s been a busy year for Apache Ambari. Keeping up with the rapid innovation in the open community certainly is exciting. We’ve already seen six releases this year to maintain a steady drumbeat of new...
View ArticleHadoop Summit Curated Content: Apache Hadoop Security
“Data is to information society what fuel was to the industrial economy: the critical resource powering the innovations that people rely on,” write Victor Mayer-Schönberger and Kenneth Cukier, in Big...
View ArticleResilience of Apache Hadoop YARN Applications across ResourceManager Restart...
Hortonworks Software Engineers Vinod Kumar Vavilapalli (Apache Hadoop YARN committer) and Jian He (Apache YARN Hadoop committer) discuss Apache Hadoop YARN’s Resource Manager resiliency upon restart in...
View ArticleAnnouncing Apache Argus: A Clarion Call
In May, Hortonworks acquired XA Secure and made a promise to contribute this technology to the Apache Software Foundation. In June, we made it available for all to download and use from our website...
View ArticleSecure JDBC and ODBC Clients’ Access to HiveServer2
Introduction HDP 2.1 ships with Apache Knox 0.4.0. This release of Apache Knox supports WebHDFS, WebHCAT, Oozie, Hive, and HBase REST APIs. Hive is a popular component used for SQL access to Hadoop,...
View ArticleContinued Innovation in Hadoop Security
We are in the midst of a data revolution. Hadoop, powered by Apache Hadoop YARN, enables enterprises to store, process, and innovate around data at a scale never seen before making security a critical...
View ArticleBuild High Performance Data Processing Application Using Apache Tez
This week we continue our YARN webinar series with detailed introduction and a developer overview of Apache Tez. Designed to express fit-to-purpose data processing logic, Tez enables batch and...
View ArticleEvolving Apache Hadoop YARN to Provide Resource and Workload Management for...
The Journey Almost to the date, two years ago the Apache Hadoop community voted to make YARN a sub-project of Apache Hadoop followed by the GA release nearly a year ago last fall. Since then, it’s...
View Article