Overview:
CloudQuery is an open-source cloud asset inventory designed for platform teams, DevOps engineers, and security professionals. It syncs metadata from cloud infrastructure (AWS, Azure, GCP) and over 70 other cloud and SaaS sources—including Wiz, Finout, and GitHub—directly into a user-managed data warehouse. The project aims to eliminate the need for scripting against disparate APIs by making cloud asset data consistently queryable with SQL, enabling teams to run analysis, enforce security policies, and manage costs from a single, normalized dataset.
Core Features:
Plugin-based source coverage: Offers specialized plugins for cloud infrastructure (AWS, GCP, Azure), security (Wiz), and FinOps (Finout), with built-in normalization, rate limit handling, and more.
Normalized SQL-queryable data: Transforms raw cloud metadata into structured, unified data accessible via standard SQL queries, removing the need to manage multiple poorly-documented APIs.
Composable architecture and high-performance syncs: Uses Apache Arrow to move large volumes of data quickly with fine-grained control; supports custom orchestrators, multiple output destinations, and a plugin system that allows extensions in multiple languages.
Self-hosted deployment: Runs entirely on the user’s own infrastructure, ensuring cloud asset data never touches CloudQuery’s servers.
Integration with external tools: Supports connections to BI tools, Slack, Jira, and other systems, turning cloud asset data into actionable signals within existing workflows.
Code-first extensibility: Open plugin system and multi-language SDK allow developers to write custom sources or destinations without vendor lock-in.
Use Cases:
Cloud asset inventory: Collect and unify configuration metadata from AWS, GCP, Azure, and dozens of other cloud providers for a single-pane-of-glass view.
Cloud Security Posture Management (CSPM): Use the normalized dataset to monitor and enforce security policies across multi-cloud environments.
Cloud FinOps: Aggregate billing data from cloud providers into one warehouse to analyze and reduce cloud spending.
Why It Matters:
CloudQuery provides a self-hosted, code-first approach to unifying cloud asset data that avoids dependence on centralized SaaS inventory tools. Because it runs on the user’s own infrastructure, sensitive metadata stays within their control. Its plugin system, Apache Arrow-powered syncs, and SQL-based querying give platform teams the flexibility to fit cloud asset inventory directly into their existing data stack and automation workflows, rather than adapting to a pre-built dashboard.




