AI-Trader

mirror of https://github.com/Xe138/AI-Trader.git synced 2026-04-02 09:37:23 -04:00

Author	SHA1	Message	Date
Bill	fb32bb12c5	fix: check column existence before creating indexes Fixes startup error 'no such column: session_id' that occurs when _create_indexes() tries to create indexes on columns that don't exist yet. The issue occurred when initializing a database from scratch: 1. _migrate_schema() adds session_id column to positions table 2. _create_indexes() tries to create index on session_id 3. But on fresh databases, positions table was created without session_id 4. Migration runs after table creation, before index creation 5. Index creation fails because column doesn't exist yet Solution: Check if columns exist before creating indexes on them. This ensures the database can be initialized both: - Fresh (CREATE TABLE without session_id, then ALTER TABLE, then CREATE INDEX) - Migrated (ALTER TABLE adds column, then CREATE INDEX) Tested: All 21 database tests passing	2025-11-02 19:24:19 -05:00
Bill	f104164187	feat: implement reasoning logs API with database-only storage Complete implementation of reasoning logs retrieval system that replaces JSONL file-based logging with database-only storage. Database Changes: - Add trading_sessions table (one record per model-day) - Add reasoning_logs table (conversation history with summaries) - Add session_id column to positions table - Add indexes for query performance Agent Changes: - Add conversation history tracking to BaseAgent - Add AI-powered summary generation using same model - Remove JSONL logging code (_log_message, _setup_logging) - Preserve in-memory conversation tracking ModelDayExecutor Changes: - Create trading session at start of execution - Store reasoning logs with AI-generated summaries - Update session summary after completion - Link positions to sessions via session_id API Changes: - Add GET /reasoning endpoint with filters (job_id, date, model) - Support include_full_conversation parameter - Return both summaries and full conversation on demand - Include deployment mode info in responses Documentation: - Add complete API reference for GET /reasoning - Add design document with architecture details - Add implementation guide with step-by-step tasks - Update Python and TypeScript client examples Testing: - Add 6 tests for conversation history tracking - Add 4 tests for summary generation - Add 5 tests for model_day_executor integration - Add 8 tests for GET /reasoning endpoint - Add 9 integration tests for E2E flow - Update existing tests for schema changes All 32 new feature tests passing. Total: 285 tests passing.	2025-11-02 18:31:02 -05:00
Bill	0098ab5501	feat: add GET /reasoning API endpoint - Add Pydantic models for reasoning API responses - Implement GET /reasoning with job_id, date, model filters - Support include_full_conversation parameter - Add comprehensive unit tests (8 tests) - Return deployment mode info in responses 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-02 18:08:39 -05:00
Bill	555f0e7b66	feat: store reasoning logs with sessions in model_day_executor - Add _create_trading_session() method to create session records - Add async _store_reasoning_logs() to store conversation with AI summaries - Add async _update_session_summary() to generate overall session summary - Modify execute() -> execute_async() with async workflow - Add execute_sync() wrapper and keep execute() as sync entry point - Update _write_results_to_db() to accept and use session_id parameter - Modify positions INSERT to include session_id foreign key - Remove old reasoning_logs code block (obsolete schema) - Add comprehensive unit tests for all new functionality All tests pass. Session-based reasoning storage now integrated.	2025-11-02 18:03:41 -05:00
Bill	396a2747d3	fix: resolve async execution and position storage issues - Fix async call in model_day_executor.py by wrapping with asyncio.run() Resolves RuntimeWarning where run_trading_session coroutine was never awaited - Remove register_agent() call in API mode to prevent file-based position storage Position data is now stored exclusively in SQLite database (jobs.db) - Update test mocks to use AsyncMock for async run_trading_session method This fixes production deployment issues: 1. Trading sessions now execute properly (async bug) 2. No position files created, database-only storage 3. All tests pass Closes issue with no trades being executed in production	2025-11-02 17:16:55 -05:00
Bill	71ec53db45	fix: read merged runtime config in containerized mode The FastAPI app now checks for /tmp/runtime_config.json (created by entrypoint.sh config merger) before falling back to the default config. This allows user-provided configs mounted at /app/user-configs/ to properly override default values in containerized deployments. Fixes issue where custom user configs were not being applied.	2025-11-02 16:16:58 -05:00
Bill	b5f18ac0f3	chore: remove diagnostic logging code Cleaned up all diagnostic print statements added during debugging. The root cause (non-idempotent get_db_path) has been fixed, so the extensive instrumentation is no longer needed. Changes: - Removed all diagnostic prints from api/main.py (lifespan and module-level) - Removed all diagnostic prints from api/database.py (get_db_connection and initialize_dev_database) - Kept essential user-facing messages (PRESERVE_DEV_DATA notice, database creation messages) All 28 integration tests pass.	2025-11-02 16:00:34 -05:00
Bill	6e4b2a4cc5	debug: add connection-level diagnostics to trace database access Enhanced diagnostics to trace database path resolution and table existence at connection time. This will help identify if get_db_connection() is resolving paths correctly and accessing the right database file. Added diagnostics to: - get_db_connection(): Show input path, resolved path, file existence, and tables found - initialize_dev_database(): Verify tables exist after creation This will reveal whether the path resolution is working correctly or if there's a timing/caching issue with database file access.	2025-11-02 15:46:36 -05:00
Bill	18bd4d169d	debug: add comprehensive diagnostic logging for database initialization Following systematic debugging methodology after 5 failed fix attempts. Adding extensive print-based diagnostics to trace execution flow in Docker. Instrumentation added to: - api/main.py: Module import, app creation, lifespan function, module-level init - api/database.py: initialize_dev_database() entry/exit and decision points This diagnostic version will help identify: 1. Whether module-level code executes in Docker 2. Which initialization layer is failing 3. Database paths being resolved 4. Environment variable values Tests confirmed passing with diagnostic logging.	2025-11-02 15:41:47 -05:00
Bill	8b91c75b32	fix: add module-level database initialization for uvicorn reliability Add database initialization at module load time to ensure it runs regardless of how uvicorn handles the lifespan context manager. Issue: The lifespan function wasn't being triggered consistently when uvicorn loads the app module, causing "no such table: jobs" errors. Solution: Initialize database when the module is imported (after app creation), providing a reliable fallback that works in all deployment scenarios. This provides defense-in-depth: 1. Lifespan function (ideal path) 2. Module-level initialization (fallback/guarantee) Both paths check deployment mode and call the appropriate init function.	2025-11-02 15:36:12 -05:00
Bill	bdb3f6a6a2	refactor: move database initialization from entrypoint to application Move database initialization logic from shell script to Python application lifespan, following separation of concerns and improving maintainability. Benefits: - Single source of truth for database initialization (api/main.py lifespan) - Better testability - Python code vs shell scripts - Clearer logging with structured messages - Easier to debug and maintain - Infrastructure (entrypoint.sh) focuses on service orchestration - Application (api/main.py) owns its data layer Changes: - Removed database init from entrypoint.sh - Enhanced lifespan function with detailed logging - Simplified entrypoint script (now 4 steps instead of 5) - All tests pass (28/28 API endpoint tests)	2025-11-02 15:32:53 -05:00
Bill	68d9f241e1	fix: use closure to capture db_path in lifespan context manager - Fix lifespan function to access db_path from create_app scope via closure - Prevents "no such table: jobs" error by ensuring database initialization runs - Previous version tried to access app.state.db_path before it was set The issue was that app.state is set after FastAPI instantiation, but the lifespan function needs the db_path during startup. Using closure allows the lifespan function to capture db_path from the create_app function scope.	2025-11-02 15:24:29 -05:00
Bill	4fec5826bb	fix: initialize dev database on API startup to prevent stale job blocking - Add database initialization to API lifespan event handler - DEV mode: Reset database on startup (unless PRESERVE_DEV_DATA=true) - PROD mode: Ensure database schema exists - Migrate from deprecated @app.on_event to modern lifespan context manager - Fixes 400 error "Another simulation job is already running" on fresh container starts This ensures the dev database is reset when the API server starts in dev mode, preventing stale "running" or "pending" jobs from blocking new job creation.	2025-11-02 15:20:51 -05:00
Bill	68aaa013b0	fix: handle 'skipped' status in job_detail_status updates - Add 'skipped' to terminal states in update_job_detail_status() - Ensures skipped dates properly: - Update status and completed_at timestamp - Store skip reason in error field - Trigger job completion checks - Add comprehensive test suite (11 tests) covering: - Database schema validation - Job completion with skipped dates - Progress tracking with skip counts - Multi-model skip handling - Skip reason storage Bug was discovered via TDD - created tests first, which revealed that skipped status wasn't being handled in the terminal state block at line 397. All 11 tests passing.	2025-11-02 09:49:50 -05:00
Bill	1f41e9d7ca	feat: add skip status tracking for job orchestration Implement skip status tracking to fix jobs hanging when dates are filtered out. Jobs now properly complete when all model-days reach terminal states (completed/failed/skipped). Changes: - database.py: Add 'skipped' status to job_details CHECK constraint - job_manager.py: Update completion logic to count skipped as done - job_manager.py: Add skipped count to progress tracking - simulation_worker.py: Implement skip tracking with per-model granularity - simulation_worker.py: Add _filter_completed_dates_with_tracking() - simulation_worker.py: Add _mark_skipped_dates() - simulation_worker.py: Update _prepare_data() to use skip tracking - simulation_worker.py: Improve warning messages to distinguish skip types Skip reasons: - "Already completed" - Position data exists from previous job - "Incomplete price data" - Missing prices (weekends/holidays/future) The implementation correctly handles multi-model scenarios where different models have different completion states for the same date.	2025-11-02 09:35:58 -05:00
Bill	aa4958bd9c	fix: use config models when empty models list provided When the trigger simulation API receives an empty models list ([]), it now correctly falls back to enabled models from config instead of running with no models. Changes: - Update condition to check for both None and empty list - Add test case for empty models list behavior - Update API documentation to clarify this behavior All 28 integration tests pass.	2025-11-02 09:07:58 -05:00
Bill	34d3317571	fix: correct BaseAgent initialization parameters in ModelDayExecutor Fixed incorrect parameter passing to BaseAgent.__init__(): - Changed model_name to basemodel (correct parameter name) - Removed invalid config parameter - Properly mapped all configuration values to BaseAgent parameters This resolves simulation job failures with error: "BaseAgent.__init__() got an unexpected keyword argument 'model_name'" Fixes initialization of trading agents in API simulation jobs.	2025-11-02 09:00:09 -05:00
Bill	3535746eb7	fix: simplify database migration for pre-production Remove complex table recreation logic since the server hasn't been deployed yet. For existing databases, simply delete and recreate. The dev database is already recreated on startup by design. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-02 07:23:58 -05:00
Bill	a42487794f	feat(api): return warnings in /simulate/status response Parse and return job warnings from database. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-02 00:13:39 -04:00
Bill	139a016a4d	refactor(api): remove price download from /simulate/trigger Move data preparation to background worker: - Fast endpoint response (<1s) - No blocking downloads - Worker handles data download and filtering - Maintains backwards compatibility Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-02 00:10:12 -04:00
Bill	5e5354e2af	feat(worker): integrate data preparation into run() method Call _prepare_data before executing trades: - Download missing data if needed - Filter completed dates - Store warnings - Handle empty date scenarios Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:49:24 -04:00
Bill	8c3e08a29b	feat(worker): add _prepare_data method Orchestrate data preparation phase: - Check missing data - Download if needed - Filter completed dates - Update job status Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:43:49 -04:00
Bill	445183d5bf	feat(worker): add _add_job_warnings helper method Delegate to JobManager.add_job_warnings for storing warnings. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:31:34 -04:00
Bill	2ab78c8552	feat(worker): add _filter_completed_dates helper method Implement idempotent behavior by skipping already-completed model-days. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:30:09 -04:00
Bill	88a3c78e07	feat(worker): add _download_price_data helper method Handle price data download with rate limit detection and warning generation. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:29:00 -04:00
Bill	a478165f35	feat(api): add warnings field to response models Add optional warnings field to: - SimulateTriggerResponse - JobStatusResponse Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:25:03 -04:00
Bill	05c2480ac4	feat(api): add JobManager.add_job_warnings method Store job warnings as JSON array in database. Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:20:50 -04:00
Bill	baa44c208a	fix: add migration logic for warnings column and update tests Critical fixes identified in code review: 1. Add warnings column migration to _migrate_schema() - Checks if warnings column exists in jobs table - Adds column via ALTER TABLE if missing - Ensures existing databases get new column on upgrade 2. Document CHECK constraint limitation - Added docstring explaining ALTER TABLE cannot add CHECK constraints - Notes that "downloading_data" status requires fresh DB or manual migration 3. Add comprehensive migration tests - test_migration_adds_warnings_column: Verifies warnings column migration - test_migration_adds_simulation_run_id_column: Tests existing migration - Both tests include cleanup to prevent cross-test contamination 4. Update test fixtures and expectations - Updated clean_db fixture to delete from all 9 tables - Fixed table count assertions (6 -> 9 tables) - Updated expected columns in schema tests All 21 database tests now pass.	2025-11-01 23:17:25 -04:00
Bill	711ae5df73	feat(db): add downloading_data status and warnings column Add support for: - downloading_data job status for visibility during data prep - warnings TEXT column for storing job-level warnings (JSON array) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 23:10:01 -04:00
Bill	73c0fcd908	fix: ensure DEV mode warning appears in Docker logs on startup - Add FastAPI @app.on_event("startup") handler to display warning - Previously only appeared when running directly (not via uvicorn) - Add DEPLOYMENT_MODE and PRESERVE_DEV_DATA to docker-compose.yml - Update CHANGELOG.md with fix documentation Fixes issue where dev mode banner wasn't visible in Docker logs because uvicorn imports app without executing __main__ block.	2025-11-01 13:40:15 -04:00
Bill	7aa93af6db	feat: add resume mode and idempotent behavior to /simulate/trigger endpoint BREAKING CHANGE: end_date is now required and cannot be null/empty New Features: - Resume mode: Set start_date to null to continue from last completed date per model - Idempotent by default: Skip already-completed dates with replace_existing=false - Per-model independence: Each model resumes from its own last completed date - Cold start handling: If no data exists in resume mode, runs only end_date as single day API Changes: - start_date: Now optional (null enables resume mode) - end_date: Now REQUIRED (cannot be null or empty string) - replace_existing: New optional field (default: false for idempotent behavior) Implementation: - Added JobManager.get_last_completed_date_for_model() method - Added JobManager.get_completed_model_dates() method - Updated create_job() to support model_day_filter for selective task creation - Fixed bug with start_date=None in price data checks Documentation: - Updated API_REFERENCE.md with complete examples and behavior matrix - Updated QUICK_START.md with resume mode examples - Updated docs/user-guide/using-the-api.md - Added CHANGELOG_NEW_API.md with migration guide - Updated all integration tests for new schema - Updated client library examples (Python, TypeScript) Migration: - Old: {"start_date": "2025-01-16"} - New: {"start_date": "2025-01-16", "end_date": "2025-01-16"} - Resume: {"start_date": null, "end_date": "2025-01-31"} See CHANGELOG_NEW_API.md for complete details.	2025-11-01 13:34:20 -04:00
Bill	b9353e34e5	feat: add prominent startup warning for DEV mode Add comprehensive warning display when server starts in development mode to ensure users are aware of simulated AI calls and data handling. Changes: - Add log_dev_mode_startup_warning() function in deployment_config.py - Display warning on main.py startup when DEPLOYMENT_MODE=DEV - Display warning on API server startup (api/main.py) - Warning shows AI simulation status and data persistence behavior - Provides clear instructions for switching to PROD mode The warning is highly visible and informs users that: - AI API calls are simulated (no costs incurred) - Data may be reset between runs (based on PRESERVE_DEV_DATA) - System is using isolated dev database and paths Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-01 12:57:54 -04:00
Bill	6e9c0b4971	feat: add deployment_mode flag to API responses	2025-11-01 11:31:49 -04:00
Bill	837962ceea	feat: integrate deployment mode path resolution in database module	2025-11-01 11:22:03 -04:00
Bill	8fb2ead8ff	feat: add dev database initialization and cleanup functions	2025-11-01 11:20:15 -04:00
Bill	2575e0c12a	fix: add database schema migration for simulation_run_id column Add automatic schema migration to handle existing databases that don't have the simulation_run_id column in the positions table. Problem: - v0.3.0-alpha.3 databases lack simulation_run_id column - CREATE TABLE IF NOT EXISTS doesn't add new columns to existing tables - Index creation fails with "no such column: simulation_run_id" Solution: - Add _migrate_schema() function to detect and migrate old schemas - Check if positions table exists and inspect its columns - ALTER TABLE to add simulation_run_id if missing - Run migration before creating indexes This allows seamless upgrades from alpha.3 to alpha.4 without manual database deletion or migration scripts. Fixes docker compose startup error: sqlite3.OperationalError: no such column: simulation_run_id Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-31 18:41:38 -04:00
Bill	c3ea358a12	test: add comprehensive test suite for v0.3.0 on-demand price downloads Add 64 new tests covering date utilities, price data management, and on-demand download workflows with 100% coverage for date_utils and 85% coverage for price_data_manager. New test files: - tests/unit/test_date_utils.py (22 tests) * Date range expansion and validation * Max simulation days configuration * Chronological ordering and boundary checks * 100% coverage of api/date_utils.py - tests/unit/test_price_data_manager.py (33 tests) * Initialization and configuration * Symbol date retrieval and coverage detection * Priority-based download ordering * Rate limit and error handling * Data storage and coverage tracking * 85% coverage of api/price_data_manager.py - tests/integration/test_on_demand_downloads.py (10 tests) * End-to-end download workflows * Rate limit handling with graceful degradation * Coverage tracking and gap detection * Data validation and filtering Code improvements: - Add DownloadError exception class for non-rate-limit failures - Update all ValueError raises to DownloadError for consistency - Add API key validation at download start - Improve response validation to check for Meta Data Test coverage: - 64 tests passing (54 unit + 10 integration) - api/date_utils.py: 100% coverage - api/price_data_manager.py: 85% coverage - Validates priority-first download strategy - Confirms graceful rate limit handling - Verifies database storage and retrieval Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-31 17:13:03 -04:00
Bill	76b946449e	feat: implement date range API and on-demand downloads (WIP phase 2) Phase 2 progress - API integration complete. API Changes: - Replace date_range (List[str]) with start_date/end_date (str) - Add automatic end_date defaulting to start_date for single day - Add date format validation - Integrate PriceDataManager for on-demand downloads - Add rate limit handling (trusts provider, no pre-config) - Validate date ranges with configurable max days (MAX_SIMULATION_DAYS) New Modules: - api/date_utils.py - Date validation and expansion utilities - scripts/migrate_price_data.py - Migration script for merged.jsonl API Flow: 1. Validate date range (start <= end, max 30 days, not future) 2. Check missing price data coverage 3. Download missing data if AUTO_DOWNLOAD_PRICE_DATA=true 4. Priority-based download (maximize date completion) 5. Create job with available trading dates 6. Graceful handling of partial data (rate limits) Configuration: - AUTO_DOWNLOAD_PRICE_DATA (default: true) - MAX_SIMULATION_DAYS (default: 30) - No rate limit configuration needed Still TODO: - Update tools/price_tools.py to read from database - Implement simulation run tracking - Update .env.example - Comprehensive testing - Documentation updates Breaking Changes: - API request format changed (date_range -> start_date/end_date) - This completes v0.3.0 preparation	2025-10-31 16:40:50 -04:00
Bill	bddf4d8b72	feat: add price data management infrastructure (WIP) Phase 1 of v0.3.0 date range and on-demand download implementation. Database changes: - Add price_data table (OHLCV data, replaces merged.jsonl) - Add price_data_coverage table (track downloaded date ranges) - Add simulation_runs table (soft delete support) - Add simulation_run_id to positions table - Add comprehensive indexes for new tables New modules: - api/price_data_manager.py - Priority-based download manager - Coverage gap detection - Smart download prioritization (maximize date completion) - Rate limit handling with retry logic - Alpha Vantage integration Configuration: - configs/nasdaq100_symbols.json - NASDAQ 100 constituent list Next steps (not yet implemented): - Migration script for merged.jsonl -> price_data - Update API models (start_date/end_date) - Update tools/price_tools.py to read from database - Simulation run tracking implementation - API integration - Tests and documentation This is work in progress for the v0.3.0 release.	2025-10-31 16:37:14 -04:00
Bill	8e7e80807b	refactor: remove config_path from API interface Makes config_path an internal server detail rather than an API parameter. Changes: - Remove config_path from SimulateTriggerRequest - Add config_path parameter to create_app() with default - Store in app.state.config_path for internal use - Update trigger endpoint to use internal config path - Change missing config error from 400 to 500 (server error) API calls now only need to specify date_range (and optionally models): POST /simulate/trigger {"date_range": ["2025-01-16"]} The server uses configs/default_config.json by default. This simplifies the API and hides implementation details from clients.	2025-10-31 15:18:56 -04:00
Bill	ec2a37e474	feat: use enabled field from config to determine which models run Changed the API to respect the 'enabled' field in model configurations, rather than requiring models to be explicitly specified in API requests. Changes: - Make 'models' parameter optional in POST /simulate/trigger - If models not provided, read config and use enabled models - If models provided, use as explicit override (for testing) - Raise error if no enabled models found and none specified - Update response message to show model count Behavior: - Default: Only runs models with "enabled": true in config - Override: Can still specify models in request for manual testing - Safety: Prevents accidental execution of disabled/expensive models Example before (required): POST /simulate/trigger {"config_path": "...", "date_range": [...], "models": ["gpt-4"]} Example after (optional): POST /simulate/trigger {"config_path": "...", "date_range": [...]} # Uses models where enabled: true This makes the config file the source of truth for which models should run, while still allowing ad-hoc overrides for testing.	2025-10-31 15:12:11 -04:00
Bill	fb9583b374	feat: transform to REST API service with SQLite persistence (v0.3.0) Major architecture transformation from batch-only to API service with database persistence for Windmill integration. ## REST API Implementation - POST /simulate/trigger - Start simulation jobs - GET /simulate/status/{job_id} - Monitor job progress - GET /results - Query results with filters (job_id, date, model) - GET /health - Service health checks ## Database Layer - SQLite persistence with 6 tables (jobs, job_details, positions, holdings, reasoning_logs, tool_usage) - Foreign key constraints with cascade deletes - Replaces JSONL file storage ## Backend Components - JobManager: Job lifecycle management with concurrency control - RuntimeConfigManager: Thread-safe isolated runtime configs - ModelDayExecutor: Single model-day execution engine - SimulationWorker: Date-sequential, model-parallel orchestration ## Testing - 102 unit and integration tests (85% coverage) - Database: 98% coverage - Job manager: 98% coverage - API endpoints: 81% coverage - Pydantic models: 100% coverage - TDD approach throughout ## Docker Deployment - Dual-mode: API server (persistent) + batch (one-time) - Health checks with 30s interval - Volume persistence for database and logs - Separate entrypoints for each mode ## Validation Tools - scripts/validate_docker_build.sh - Build validation - scripts/test_api_endpoints.sh - Complete API testing - scripts/test_batch_mode.sh - Batch mode validation - DOCKER_API.md - Deployment guide - TESTING_GUIDE.md - Testing procedures ## Configuration - API_PORT environment variable (default: 8080) - Backwards compatible with existing configs - FastAPI, uvicorn, pydantic>=2.0 dependencies Co-Authored-By: AI Assistant <noreply@example.com>	2025-10-31 11:47:10 -04:00

42 Commits