Data Lake 3.0: The EZ button to deploy in minutes and cut TCO by half
The new year brings new innovation and collaborative efforts. Various teams from the Apache community have been working hard for the last eighteen months to bring the EZ button to Apache Hadoop...
View ArticleData Lake 3.0 Part 2 – A Multi-Colored YARN
Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we briefly introduced the power of leveraging prepackaged applications in Data Lake 3.0 and how the focus will shift from the...
View ArticleDatalake 3.0 Part 3 – Distributed TensorFlow Assembly on Apache Hadoop YARN
Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is and in part 2 of the series, we talked about how a multi-colored YARN will play a critical...
View ArticleDetecting Hackers and Impersonators with Machine Learning
The 2014 Yahoo email hack is a good illustration how a big data security analytics platform such as Apache Metron can make it easier to detect, investigate, assess, and remediate threats in your...
View ArticleMachine Learning: A new frontier
Large-scale Machine Learning The ability to learn without being explicitly programmed, Machine Learning, has been around for a long time and is well understood. What is different is the relatively...
View ArticleFour Trends in Artificial Intelligence That Affect Enterprises
Andrew Ng, the renowned chief data scientist, has said that artificial intelligence (AI) needs to be a company-wide strategic decision. Companies that don’t strategically invest in AI will slowly lose...
View ArticleIntegrate SparkR and R for Better Data Science Workflow
R is one of the primary programming languages for data science with more than 10,000 packages. R is an open source software that is widely taught in colleges and universities as part of statistics and...
View ArticleHCC- Top 5 Technical Articles from Last Week
It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC Supporting Custom...
View ArticleHCC Top Posts and Articles
It has been another exciting week on Hortonworks Community Connection HCC. We continue to see great activity and recommend the following assets from last week. Top Articles from HCC One Way Trust – MIT...
View ArticleIndustry Trends and Apache Spark’s Evolving Role in the Big Data Landscape
Apache Spark has been Open Source’s new kid on the block. Companies are using Spark to develop sophisticated models that would enable them to discover new opportunities or avoid risk. But what does the...
View ArticleEnterprise NiFi: Implementing Reusable Components and a Software Development...
Originally posted in HCC 1. Introduction NiFi is a powerful and easy to use technology to build dataflows from diverse sources to diverse targets while transforming and dynamically routing in between....
View Article10 Questions on Hortonworks Data Cloud for AWS
We recently concluded our highly attended How to Get Started with Hortonworks Data Cloud for AWS Webinars. Thank you Jeff Sposetti and Sean Roberts for hosting the sessions. The webinars provided a...
View ArticleTry Apache Spark 2.1 & Zeppelin in Hortonworks Data Cloud
Apache Spark 2.1 was released recently in the community. The main focus of this release was improvements in Structured Streaming and Machine Learning. Structured Streaming: Kafka .10 support, Metrics...
View ArticleWelcome to Apache Zeppelin 0.7.0
We are very excited about the release of Apache Zeppelin 0.7.0 and want to thank the Apache Foundation along with the Apache Zeppelin community. The long awaited release introduces several key features...
View ArticleCISO’s View: Metrics as the Foundation – Part 1
Welcome back to my blog series, the CISO’s View. In my last article, CISO’s View: Why an integrated approach matters, I stirred up the waters a bit by stating that the CISO’s first and most...
View ArticleCISO’s View: Metrics as the Foundation – Part 2
Welcome back to my blog series, the CISO’s View. In my last article, CISO’s View: metrics part 1, we started looking at metrics and why they are the foundation of a successful security program. Today,...
View ArticleData Lake 3.0: The EZ button to deploy in minutes and cut TCO by half
The new year brings new innovation and collaborative efforts. Various teams from the Apache community have been working hard for the last eighteen months to bring the EZ button to Apache Hadoop...
View ArticleData Lake 3.0 Part 2 – A Multi-Colored YARN
Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we briefly introduced the power of leveraging prepackaged applications in Data Lake 3.0 and how the focus will shift from the...
View ArticleDatalake 3.0 Part 3 – Distributed TensorFlow Assembly on Apache Hadoop YARN
Thank you for reading our Data Lake 3.0 series! In part 1 of the series, we introduced what a Data Lake 3.0 is and in part 2 of the series, we talked about how a multi-colored YARN will play a critical...
View ArticleDetecting Hackers and Impersonators with Machine Learning
The 2014 Yahoo email hack is a good illustration how a big data security analytics platform such as Apache Metron can make it easier to detect, investigate, assess, and remediate threats in your...
View Article