Systems | Development | Analytics | API | Testing

Operational Database Management

This blog post is part of a series on Cloudera’s Operational Database (OpDB) in CDP. Each post goes into more details about new features and capabilities. Start from the beginning of the series with, Operational Database in CDP. This blog post gives you an overview of the OpDB management tools and features in the Cloudera Data Platform. The tools discussed in this article will help you understand the various options available to manage the operations of your OpDB cluster.

Challenges of running a big data distro in the cloud

There are many reasons to run a big data distribution, such as Cloudera Data Hub (CDH) and Hortonworks Data Platform (HDP), in the cloud with Infrastructure-as-a-Service (IaaS). The main reason is agility. When the business needs to onboard a new use case, a data admin can bring on additional virtual infrastructure to their clusters in the cloud in minutes or hours. With an on-prem cluster, it may take weeks or months to add the infrastructure capacity for the new use cases.

Allegro Trains trains-agent installation tutorial

Installation and configuration tutorial for Trains-Agent, Allegro AI's zero configuration fire-and-forget execution agent for the Allegro Trains solution. Allegro-Agent enables ML-Ops / DevOps orchestration, queue management, remote execution, automation and more - for the Allegro Trains solution. Allegro Trains is an open source machine and deep learning (ML / DL) experiment manager, versioning and ML-Ops full system solution for data science and data engineering teams and projects.

Predicting the Future With Linear Regression in Ruby

The world is full of linear relationships. When one apple costs $1 and two apples cost $2, it's easy to figure out the price of any number of apples. But what happens when you have 100s of data points? What if your data source is noisy? That's when it's helpful to use a technique called linear regression. In this article Julie Kent shows us how linear regression works, and walks through a practical example in Ruby.

Evolving Insurance with Data and Analytics

Insurance companies around the world are striving ahead with innovative offerings that are fundamentally changing the insurance landscape. Insurance companies are creating personalized offerings and products that are tailored to the specific needs of their customers. For example, they are implementing usage-based insurance (UBI) based on driving habits, miles driven and driving history and discounts on health insurance based on health trackers, etc.).

Qlik Data Analytics - April 2020 Feature Demonstration

A longer detailed demonstration on more of the demonstrable features available in the Qlik Sense April 2020 release. Note an attempt was made to create an index with time code URLs - but for some reason clicking the time-code link in the description just brings you to the start of the video. Summary Video: Demo App.

The U.S. Census Enters the Digital Age with Cloudera

2020 brings a new decade, and for the U.S Census Bureau, a new challenge. As the federal government’s—and the nation’s—leading provider of demographic and economic data, its largest initiative is the U.S. Census, which is conducted every 10 years and counts every resident in the United States. For the first time in U.S history, the census will be conducted primarily online instead of by mail.

Now Is the Time to Take Stock in Your Dataops Readiness: Are Your Systems Ready?

As the global business climate is experiencing rapid change due to the health crisis, the role of data to provide much needed solutions to urgent issues are being highlighted throughout the world. Helping customers manage critical modern data systems for years, Unravel sees a heightened interest in fortifying the reliability of business operations in healthcare, logistics, financial services and telecommunications.