Systems | Development | Analytics | API | Testing

Leveraging BigQuery Audit Log pipelines for Usage Analytics

In the BigQuery Spotlight series, we talked about Monitoring. This post focuses on using Audit Logs for deep dive monitoring. BigQuery Audit Logs are a collection of logs provided by Google Cloud that provide insight into operations related to your use of BigQuery. A wealth of information is available to you in the Audit Logs. Cloud Logging captures events which can show “who” performed “what” activity and “how” the system behaved.

Is SSIS a Good ETL Tool?

ETL (Extract, Transfer and Load) is a well-known data integration process. There is an overwhelming number of tools that you can use (one of which is SSIS) and it can be difficult to choose between them. What exactly is SSIS, and how can it help your company perform ETL better than you ever have before? This article will explain the major features of SSIS, demonstrate the pros and cons of implementing it, and advise as to when you might be better off with a different ETL tool.

Modern Data Stack using Integrate.io for the ELT

Integrate.io is a company that provides an ELT (Extract, Load and Transform) data stack. They can do transformations using DBT, which stands for Database Transformation toolkit. Then they use Integrate.io again to push the data into systems like Salesforce. This system will allow you to have better control over your data and provide a cost-effective solution.

Data Goes Around The World In 80 Seconds With Snowflake

See how a database named Phileas Fogg can journey around the world in 80 seconds on Snowflake in this animated short. With Snowflake, PHILEAS_FOGG can failover in the event of disruption to enable continuous business operations and be joined with local data sets for global data collaboration across clouds.

Adopting a Production-First Approach to Enterprise AI

After a year packed with one machine learning and data science event after another, it’s clear that there are a few different definitions of the term ‘MLOps’ floating around. One convention uses MLOps to mean the cycle of training an AI model: preparing the data, evaluating, and training the model. This iterative or interactive model often includes AutoML capabilities, and what happens outside the scope of the trained model is not included in this definition.

PostgreSQL to Amazon Redshift: 4 Ways to Replicate Your Data

PostgreSQL is the preferred platform of millions of developers around the world. The open-source tool is one of the most powerful databases on the planet, with the ability to handle sophisticated analytical workloads and high levels of concurrency. That makes PostgreSQL (also called Postgres) a popular DB for scientific research and AI/ML projects. It’s also a popular production database for data-driven companies in every industry. But no database is perfect.