jobTakt: German Job Market Intelligence
A data engineering project processing 18K+ jobs daily.
Data Insight
View InsightsA Glimpse of the Pipeline
Data flows from multiple sources, is processed and stored, then served to the frontend.
Core Project Features
This project demonstrates a wide range of data engineering principles and practices.
All job data is stored in a single, unified PostgreSQL database, acting as the single source of truth.
The data pipeline runs automatically multiple times a day, ensuring the job board is always up-to-date.
Jobs from various sources are transformed and loaded into a standardized format for consistency.
The frontend allows users to filter jobs by location, keywords, and other criteria in real-time.
The architecture is built to handle a growing number of data sources and increasing data volume with ease.
The system logs recent activities and key metrics, providing a clear view of the pipeline's health and performance.
Recent Pipeline Activity
A live look at the most recent data harvesting and processing events.
Scraped Adesso: Found 1754 jobs (648 new).
2 minutes ago
Scraped Metro AG: Found 28 jobs (0 new).
4 minutes ago
Scraped Solaris: Found 14 jobs (0 new).
4 minutes ago
Scraped Clark: Found 6 jobs (0 new).
4 minutes ago
Scraped Bundesagentur für Arbeit: Found 0 jobs (0 new).
4 minutes ago