Commit Graph

6 Commits

Author SHA1 Message Date
Joshua Salako 9c429caa56 Implement streaming pipeline endpoint for batch processing
- Added `/run/streaming-pipeline` endpoint to process data in batches from either a database or CSV file.
- Introduced `BatchResponse` model for structured responses.
- Updated README with new endpoint details, including parameters and example usage.
- Enhanced error handling and logging during batch processing.
- Ensured data preprocessing and NaN handling in analysis functions.
2025-05-02 14:25:31 +01:00
Joshua Salako 5767f55686 Update project structure and enhance model persistence
- Added new model and scaler files to .gitignore and output directory.
- Updated Dockerfile to create output/models directory.
- Revised README to include instructions for using a .env file for configuration.
- Enhanced config.py to load database credentials from environment variables.
- Implemented model saving functionality in salary_predictor.py for consistent and inconsistent earners.
2025-05-02 00:16:46 +01:00
Joshua Salako 8acfb436f3 Enhance API with data loading functionality and update README.
- Added `/load-data` endpoint to load transaction data from either a database or a CSV file.
- Updated `SalaryAnalyticsPipeline` and `DataLoader` to support loading from CSV.
- Implemented data validation and error handling for loading processes.
- Revised README to include new data loading instructions and workflow steps.
- Added checks to ensure data is loaded before running analysis endpoints.
2025-05-01 22:57:55 +01:00
Joshua Salako 7e7094f0fd Remove salary.py file, eliminating all salary transaction analysis and related functions. 2025-04-28 19:45:19 +01:00
Joshua Salako 591d4611b6 Added new salary-related terms and improved image outputs in salary.ipynb 2025-04-28 19:44:40 +01:00
Joshua Salako 8207d8f1ff first commit 2025-04-25 00:01:38 +01:00