jobTakt: German Job Market Intelligence

A data engineering project processing 18K+ jobs daily.

Data Insight

View Insights

A Glimpse of the Pipeline

Data flows from multiple sources, is processed and stored, then served to the frontend.

Data Sources
Smart Scripts
Validation
Database
API
Web UI
Click to explore the full architecture

Core Project Features

This project demonstrates a wide range of data engineering principles and practices.

Centralized Database

All job data is stored in a single, unified PostgreSQL database, acting as the single source of truth.

Automated Refresh

The data pipeline runs automatically multiple times a day, ensuring the job board is always up-to-date.

Data Unification

Jobs from various sources are transformed and loaded into a standardized format for consistency.

Dynamic Filtering

The frontend allows users to filter jobs by location, keywords, and other criteria in real-time.

Scalable Design

The architecture is built to handle a growing number of data sources and increasing data volume with ease.

Pipeline Monitoring

The system logs recent activities and key metrics, providing a clear view of the pipeline's health and performance.

Recent Pipeline Activity

A live look at the most recent data harvesting and processing events.

  • Scraped Adesso: Found 1754 jobs (648 new).

    2 minutes ago

  • Scraped Metro AG: Found 28 jobs (0 new).

    4 minutes ago

  • Scraped Solaris: Found 14 jobs (0 new).

    4 minutes ago

  • Scraped Clark: Found 6 jobs (0 new).

    4 minutes ago

  • Scraped Bundesagentur für Arbeit: Found 0 jobs (0 new).

    4 minutes ago