Workload Automation Blog

A New Tool for the Times as Oozie is Past Its Prime

3 minute read
Basil Faruqui

 

If you need to hang a picture or replace a few roof shingles, the first thing you’ll probably reach for is a hammer, which has been used for centuries and has a future as secure as any tool in the toolbox. But if you need dental work, you don’t want your dentist using old technology like a hammer and pliers. In fact you’ll insist that your dentist use tools and techniques that have evolved from hundreds of years ago.

Most decisions between using tried-and-true vs. innovative-and-new are not so easy, a fact I was reminded of by reading this intriguing InfoWorld blog 7 Big Data Tools to Ditch in 2017. Because big data is changing and maturing so quickly, its underlying tools and technologies can quickly become out of date. Big data teams are so busy with projects that it’s hard for them to keep up with the latest developments in the ecosystem. We at BMC hear this a lot from our customers, and we can empathize because we’re doing a lot of big data development too.

Because many of the tools for the Hadoop ecosystem have been developed in open source, they have some inconsistencies and limitations that don’t get addressed as the open source community turns its attention to developing the next set of tools to solve emerging problems. Enterprises are left to use these tools until something better comes along.

Now the good news: Something better has come along for working with Hadoop workflows. Most enterprises first tried Oozie for Hadoop workflow management. Now Oozie has earned a target as one of the top tools to ditch in 2017, because as the InfoWorld blog noted:

I’ve long hated on Oozie. It isn’t much of a workflow engine or much of a scheduler – yet it’s both and neither at the same time! It is, however, a collection of bugs for a piece of software that shouldn’t be that hard to write.”

Our customers have specifically been telling us they want a better alternative than Oozie for creating, scheduling and managing Hadoop workflows. We believe we’ve delivered the superior alternative with Control-M for Hadoop, which automates big data batch processes and enables them to be developed, scheduled, managed and monitored with all other enterprise workloads in a single solution. It takes the complexity out of automating and scheduling big data workflows, which leads to faster implementation and more accurate results.

Our approach was validated in recent third-party testing that found Hadoop workflows could be developed 40 percent faster using Control-M for Hadoop instead of Oozie and other open source tools. More importantly, real-world customers like Navistar, a leading manufacturer of commercial vehicles that uses big data to improve vehicle uptime, have documented the value of our approach. Here is an excerpt on the subject:

“Prior to bringing Control-M into our Hadoop environment, we had two engineers working full time pulling this data from AWS, aggregating it, and putting it into a spreadsheet for the design engineers. It took about a week to get the data into a usable form. Control-M now handles these tasks automatically, freeing up those two engineers to take on strategic tasks. In addition, the design engineers now have real-time access to data that’s viewable on easy-to-read dashboards, a marked improvement over static reports based on data that’s a week old.”

Not only is Control-M faster than Oozie, it covers and automates many more of the tasks required to develop, test, promote, deploy, schedule, manage and secure workflows. Our customers told us what their Big Data development challenges are and we listened. If you want to get your Big Data work done more quickly and easily, consider adding a new tool to your toolbox.

For more information about BMC big data solutions, visit https://www.bmc.com/it-solutions/big-data.html

EMA Radar™ Report for Workload Automation and Orchestration 2023

To stay agile and innovative while ensuring reliability, businesses need to be able to orchestrate application and data workflows easily from development through to production. According to EMA, Control-M delivers more value than any other Workload Automation (WLA) solution on the market—helping IT elevate the business impact of this core discipline.


These postings are my own and do not necessarily represent BMC's position, strategies, or opinion.

See an error or have a suggestion? Please let us know by emailing blogs@bmc.com.

BMC Brings the A-Game

BMC works with 86% of the Forbes Global 50 and customers and partners around the world to create their future. With our history of innovation, industry-leading automation, operations, and service management solutions, combined with unmatched flexibility, we help organizations free up time and space to become an Autonomous Digital Enterprise that conquers the opportunities ahead.
Learn more about BMC ›

About the author

Basil Faruqui

Basil joined BMC in 2003 and has worked in several different technical and management roles within Control-M and Remedy product lines. He is currently working as a Principal Solutions Marketing manager for Control-M where his areas of focus include DevOps, big data and cloud. Basil has an MBA in Marketing and a BBA in Management Information Systems from the University of Houston. He has more than 15 years’ experience in technology that spans Software Development, Customer Support, Marketing, Business Planning and Knowledge Management.