Data Warehouse and Filestore

Admins will need to set up the warehouse and filestore before Kleene can be used

Warehouse and Filestore Overview

Purpose of the Warehouse

The Warehouse in serves as the central repository for all data ingested into the platform. It stores raw, cleaned, and transformed data in a structured format, making it accessible for querying, reporting, and analysis. The Warehouse supports efficient data management, enabling users to:

  • Ingest Data: Collect data from various sources, including APIs, databases, and manual uploads.
  • Transform Data: Apply transformations to raw data to prepare it for analysis.
  • Store Data: Maintain historical data for long-term analysis and compliance.

Purpose of the Filestore

The Filestore in (sometimes referred to as the data lake) is designed to handle unstructured and semi-structured data and be a landing area for all data before it gets loaded to the warehouse. The Filestore enables users to:

  • Store Large Files: Accommodate large datasets, logs, images, and other file types not suitable for the Warehouse.
  • Backup Data: Provide a secure location for data backups and archives, which is an easy way to recover any lost Warehouse data from.
  • Improve Data Accessibility: Ensure easy access to unstructured data for further processing and analysis.
  • Alternate Compute Profiles: Allow teams to query raw data with compute outside the Data Warehouse

Integration and Workflow

Data Ingestion: Data is first ingested into the Filestore, then moved to the Warehouse to be structured and transformed.
Data Transformation: Structured data in the Warehouse is transformed using SQL or other tools provided by
Data Utilization: Transformed data is used for generating insights, reporting, and advanced analytics.

By combining the capabilities of both the Warehouse and the Filestore, provides a comprehensive data management solution that supports diverse data types and complex analytical needs.

What’s Next