The Clear SHOW - S02E07 - Manual Orchestration (Pit Stop!)

Before we write the super-easy automation for our feature-store workflow, we have to make sure we all understand how to run a task on a clearml-agent! Join T. Guerre for a quick demo of what ClearML can do for manual orchestration of workflows, once you have used it to manage your experiments! ClearML - Your entire workflow in one MLOps platform

Jeeves Grows Up: How an AI Chatbot Became Part of Unravel Data

Jeeves is the stereotypical English butler – and an AI chatbot that answers pertinent and important questions about Spark jobs in production. Shivnath Babu, CTO and co-founder of Unravel Data, spoke yesterday at Data + AI Summit, formerly known as Spark Summit, about the evolution of Jeeves, and how the technology has become a key supporting pillar within Unravel Data’s software.

DataStore vs FeatureStore

I think it’s safe to say that one of the worst things in Machine Learning is the terminology. The maths and statistics are definitely part of the learning curve, but more than that, it feels like you are learning a new language. In some ways, you are. DataStore and FeatureStore are two of the current buzzwords that people are trying to understand. To be fair, DataStore and FeatureStore feel like family rather than strangers.

What is Data as a Service (DaaS)?

As the amount of data companies are faced with snowballs, the need for efficient data governance grows. An increasing number of organizations are turning to cloud service providers for data management. In this context, data as a service, often referred to as DaaS, is becoming an essential tool for managing data integration, data storage, and data analytics.

What is File Transfer Protocol?

Transferring files between two or more machines is an essential part of the ETL (extract, transform, load) process. Of course, there are multiple ways to move data, including flat file databases. For example, you can physically copy the data onto a USB drive or send it to the recipient via email. But methods like these are far less efficient than sending data via FTP. So what is FTP exactly, and how do you use it to transfer files and data? Keep reading for all the answers.

The Complete Guide to GDPR Compliance

The General Data Protection Regulation (GDPR) is a landmark piece of legislation that affects how organizations can handle, process, and store the personal data of European Union (EU) citizens and residents. But what does the GDPR require exactly, and how can you be sure that your organization complies with it? We go over everything you need to know in this all-in-one guide to GDPR compliance.

Say Goodbye to Data Quality with ELT

ELT is a three-step process that first extracts raw, structured, and unstructured data from source databases, applications, data stores, and other repositories. It then loads that data into a data lake and transforms it as needed by analysts. Since it doesn't move the data to an intermediate staging area or transform it before loading, the extraction process is speedy. You don’t need to pick and choose what data loads into the data lake or wait for it to be processed.