LogoLogo
Kadeck DocumentationSubmit Ticket
  • Platform overview
  • Deployment
  • API Reference
  • Platform updates
  • Introduction
  • Context & Scope
  • Interoperability & Network
  • System Requirements
  • Configuration List
  • Deployments
    • Overview
    • Basic Setup
    • Advanced Setup
    • Automated Deployment
    • Identity Provider Integration
    • License Activation Scenarios
  • FAQs
    • How do I create a certificate and keystore?
  • How do I access the container images?
  • How to make container images offline available?
  • How do I configure the memory for Portal?

Legal

  • Legal Notice
  • Privacy Policy

© 2025 Xeotek Inc. and its affiliates

On this page
  • Components
  • Portal
  • Argus
  • Machina
  • Context

Was this helpful?

Export as PDF

Context & Scope

PreviousIntroductionNextInteroperability & Network

Last updated 1 month ago

Was this helpful?

The Data Streamhouse is architected as a modular system, with each component responsible for a distinct operational concern. This separation enables horizontal scalability, fault isolation, and performance optimization—allowing the platform to support environments with hundreds of Kafka clusters and millions of active data streams.

Components

Portal

Portal serves as the central control plane for distributed real-time data systems. It connects to Apache Kafka clusters and related components to unify access, governance, and operations across teams and environments. Users interact with Portal through a web-based interface.

Argus

Argus monitors one or more Apache Kafka clusters. It collects metrics, performs system health checks, and runs the Health Assistant to detect issues across the cluster and its associated components. Argus operates headlessly and reports back to Portal. Cluster assignments are managed via Portal.

Machina

Machina is the computational core of the Data Streamhouse. It executes data streaming applications and processes, such as data integration, processing, or custom analytics. Machina instances are deployed and managed independently for workload isolation and scale.

Context

The Data Streamhouse runs entirely within your controlled infrastructure—whether in private cloud, on-premise data centers, or containerized environments on public cloud platforms. The diagram below outlines the core components of the system (Portal, Machina, Argus) and the external systems and actors that interface with it.

A single Data Streamhouse typically connects to multiple, separate data streaming environments—each with its own Kafka clusters, Schema Registries, Connect clusters, and associated applications. This multi-environment architecture supports hybrid setups and allows organizations to centralize governance, monitoring, and control across all streaming domains from one unified platform.

Key external integrations include:

  • Authentication providers (e.g., LDAP, OpenID)

  • Automation and provisioning systems (IaC, CI/CD)

  • Alerting and messaging platforms

  • Public APIs for integration with internal tooling and workflows