search
Clients cases Google Cloud – Client Case: MacPaw

About the company

MacPaw is a product-driven IT company that creates innovative software designed to make Mac users more productive. Millions of people worldwide use MacPaw products, including CleanMyMac, CleanMy®Phone, Setapp, and ClearVPN. The company was founded by Oleksandr Kosovan in 2008.

Today, MacPaw focuses on expanding the capabilities of users, teams, and developers through artificial intelligence. Eney by MacPaw is an AI assistant for Mac that helps users automate tasks, interact with their devices, and move toward the Software 3.0 era.
Cybersecurity is another key area for MacPaw. The company researches emerging types of malicious software and delivers reliable protection through Moonlock, a new cybersecurity application for Mac users.

Project Start Date: May 30, 2020
Project End Date: Collaboration Continues

Country

Ukraine

Industry

IT

Software

Technology

Tech Stack

Google Cloud Platform

BigQuery

Google Cloud Storage

Google Kubernetes Engine

Cloud Composer

Apache Spark

Pub/Sub

Looker Studio

Vertex AI

Stats

10x faster query performance thanks to BigQuery’s serverless architecture

3x reduction in costs for analytics support through BigQuery’s on-demand pricing model

Dozens of datasets with access controls via IAM and Authorized Views

BigQuery in Practice: How MacPaw Achieved 10x Faster Analytics and Built a Unified DataHub

 

The Challenge: The Need for Fast Analytics

MacPaw processes large volumes of data every day, including user behavior, marketing performance, and analysis of new product releases. This work is a daily responsibility for business analysts and often requires significant time and resources.

Initially, the company relied on Google Analytics and small in-house solutions. It soon became clear that this approach was insufficient. Google Analytics limited access to raw data due to sampling, and combining its metrics with other sources—ad platforms, payment systems, internal products, and backend services—was not possible.

To gain a complete and manageable data set, the team built an analytics platform on Redshift. Over time, Redshift no longer met requirements for speed and flexibility, which led to the search for a new technical foundation. As a result, the team chose Google BigQuery—a decision that became the backbone of a new DataHub.

BigQuery as the Foundation of DataHub

Once the team migrated to BigQuery, it became clear that this was more than just another data warehouse. It was a different way of thinking about analytics.

Today, the team uses BigQuery as the core platform for data storage and processing and has built its own internal product—DataHub. It aggregates data from MacPaw websites and additional systems and is actively used for analytics and reporting. This is where everyone—from analysts to marketers—finds answers.

Why BigQuery?

The MacPaw team highlighted several decisive advantages that made BigQuery the right choice:

  • Fast data processing that enables complex calculations with results in seconds rather than hours.
  • A high level of security, essential for storing business-critical data.
  • Usage-based pricing tied to the volume of processed data, suitable for both large and small businesses.
  • An easy-to-use interface that lets analysts query data without writing code or relying on engineers.
  • Built-in ML capabilities that enable the use of Google Cloud machine learning and AI services.
  • Data visualization in Looker Studio (formerly Google Data Studio).

Since then, the company has significantly expanded its Google Cloud architecture, increased data volumes, and launched its first AI pipelines. Below is how MacPaw’s data ecosystem has evolved.

“With Google BigQuery, even minimal computer resources are enough to process any volume of data. All calculations happen on Google’s servers, while your device simply sends queries and receives results. It’s faster and more straightforward.”
Dmitry Osiyuk Lead Analyst MacPaw

How It Works Today: Current Architecture and Scale

Since 2020, MacPaw’s use of Google Cloud for data workloads has grown more than threefold. This growth resulted from a shift to in-house analytics solutions and ecosystem expansion.

Today, MacPaw operates a flexible Google Cloud architecture that covers every key stage of the data lifecycle and is built around BigQuery as the central core. Processing and automation run on Google Kubernetes Engine (GKE) using Airflow and Spark.

Pub/Sub is used for event ingestion, while all data—from raw to processed—is stored in Google Cloud Storage (GCS) in Delta Lake format. BigQuery serves as the primary data warehouse, combining managed tables with BigLake to analyze data directly from GCS.

Results are visualized in Looker Studio, and the Data Science team trains and deploys models on Vertex AI.

A Unified Ecosystem for the Business

This architecture allows the team to launch new products quickly and test ideas without worrying about resource constraints. All data lives in a shared repository with clearly defined access controls via IAM. Dozens of datasets are no longer a challenge—they are the norm.

As a result, MacPaw gained a single platform that provides:

  • a stable infrastructure for product operations;
  • fast and secure access to analytics;
  • the ability to scale AI initiatives without investing in on-prem infrastructure.

 

MacPaw team

The Evolution of BigQuery: From Initial Datasets to Dozens

MacPaw considers increased team productivity and efficiency when working with large data volumes to be the main business benefit. BigQuery reduces wait times for calculations and results, which directly speeds up analytics and overall company operations.

The platform scales easily and is flexible to manage, simplifying data integration for data engineers and allowing the company to host growing volumes of data without purchasing additional servers.

  • At the start, DataHub worked with 1–2 datasets.
  • Today—dozens of datasets with access controls via IAM and Authorized Views.
  • Used BigQuery streaming ingestion for near-real-time analytics.
  • Added BigLake for SQL queries directly on data stored in GCS.
“Thanks to the virtually unlimited resources of BigQuery, we can always analyze the required amount of data, knowing capacity is no longer a concern. This is critical during release and update periods, when many stakeholders need insights quickly. BigQuery is the first place analysts look—and the source for dashboards used by managers at every level.”
Oleksii Sopov Lead Data Engineer, MacPaw

Vertex AI: When Analytics Turn Into Predictions

The next step was moving beyond analysis to building predictions. To achieve this, MacPaw integrated Vertex AI.

Vertex AI serves as the foundation for machine learning at MacPaw. It is integrated with the entire Google Cloud ecosystem, helping teams run experiments quickly and reuse training pipelines across projects , including work with large language models and deep learning—without additional infrastructure.

The outcome: faster model development cycles, scalable AI projects, and transparent cost control.

Business Impact

MacPaw built a unified data ecosystem where every data-driven decision starts in BigQuery and ends in Looker Studio dashboards. The company optimized costs while enabling secure data access at scale and reducing time-to-insight as the company expands.

Key Results:

10x faster query performance thanks to BigQuery’s serverless architecture, which instantly provides the necessary resources for computations of any complexity.
3x reduction in costs for analytics support through BigQuery’s on-demand pricing model.
Efficient processing of terabytes of data daily by the analytics and data engineering team, as BigQuery completely handles infrastructure administration.
Centralization of 90%+ of data in BigQuery and GCS, ensuring a single source of truth and reliability for the entire company.
Looker Studio provides real insights for marketing, product, and leadership by working directly with data in BigQuery.
Vertex AI enabled the shift from simple analytics to creating predictive models, laying a solid foundation for future AI use cases in MacPaw products.
Expand your analytics capabilities, cut unnecessary costs, and speed up AI adoption with Cloudfresh Let's Connect

What’s Next: AI as the Next Priority

MacPaw continues to expand its AI initiatives, focusing on Vertex AI and Google Cloud services that allow teams to launch predictive models quickly, experiment freely, and control infrastructure costs.

“I believe that AI is an area that should be strengthened through cloud technologies. AI and Cloud are now almost inseparable, and I would like to see more tools and technologies.”
Oleksii Sopov Lead Data Engineer, MacPaw

MacPaw Cloudfresh Office

Cloudfresh as a Trusted Partner for MacPaw

To ensure everything runs reliably, MacPaw needed more than just the right technology—they needed a team to advise, support, and step in when needed. That partner is Cloudfresh. As a Google Cloud Premier Partner, Cloudfresh provided end-to-end consulting on BigQuery and helped integrate it into MacPaw’s ecosystem. Google Cloud experts at Cloudfresh continue to support MacPaw on technical matters.

What did Cloudfresh’s role include?

  1. Data architecture consulting: optimizing BigQuery dataset structures, IAM setup, security, and scalability.
  2. Rapid support in critical situations: assisting with service recovery and minimizing downtime risks.
  3. Direct communication with Google: acting as a reliable intermediary to remove blockers through direct vendor engagement.
  4. AI and ML expertise: advising on the use of Vertex AI to accelerate model deployment and scaling.

Our partnership with Cloudfresh continues — we are still here to strengthen the MacPaw team with expertise, support, and new Google Cloud capabilities.

Get in touch with Сloudfresh