feat(job_crawler): initialize job crawler service with kafka integration

- Add technical documentation (技术方案.md) with system architecture and design details
- Create FastAPI application structure with modular organization (api, core, models, services, utils)
- Implement job data crawler service with incremental collection from third-party API
- Add Kafka service integration with Docker Compose configuration for message queue
- Create data models for job listings, progress tracking, and API responses
- Implement REST API endpoints for data consumption (/consume, /status) and task management
- Add progress persistence layer using SQLite for tracking collection offsets
- Implement date filtering logic to extract data published within 7 days
- Create API client service for third-party data source integration
- Add configuration management with environment-based settings
- Include Docker support with Dockerfile and docker-compose.yml for containerized deployment
- Add logging configuration and utility functions for date parsing
- Include requirements.txt with all Python dependencies and README documentation

This commit is contained in:

李顺东

2026-01-15 17:09:43 +08:00

commit ae681575b9

26 changed files with 1898 additions and 0 deletions

									
										4

job_crawler/app/utils/__init__.py
									
										Normal file
									
												View File
												
				@@ -0,0 +1,4 @@

				"""工具模块"""

				from .date_parser import parse_aae397, parse_collect_time, is_within_days

				__all__ = ["parse_aae397", "parse_collect_time", "is_within_days"]

feat(job_crawler): initialize job crawler service with kafka integration

4 job_crawler/app/utils/__init__.py Normal file Unescape Escape View File

4

job_crawler/app/utils/init.py Normal file

View File