monvir/ai-hackaton-backend

Fork 0

Mikhail Kraevskii ab2b417d78 Agent lifecycle

2025-09-05 00:31:31 +03:00

4.9 KiB

Raw Blame History

CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Common Development Commands

Application Startup

# Start FastAPI server
uvicorn app.main:app --reload --port 8000

# Start Celery worker (required for resume processing)
celery -A celery_worker.celery_app worker --loglevel=info

# Start LiveKit server (for voice interviews)
docker run --rm -p 7880:7880 -p 7881:7881 livekit/livekit-server --dev

Database Management

# Run database migrations
alembic upgrade head

# Create new migration
alembic revision --autogenerate -m "description"

Code Quality

# Format code and fix imports
ruff format .

# Lint and auto-fix issues
ruff check . --fix

# Type checking
mypy .

Testing

# Run basic system tests
python simple_test.py

# Run comprehensive tests
python test_system.py

# Test agent integration
python test_agent_integration.py

# Run pytest suite
pytest

Architecture Overview

Core Components

FastAPI Application (app/):

main.py: Application entry point with middleware and router configuration
routers/: API endpoints organized by domain (resume, interview, vacancy, admin)
models/: SQLModel database schemas with enums and relationships
services/: Business logic layer handling complex operations
repositories/: Data access layer using SQLModel/SQLAlchemy

Background Processing (celery_worker/):

celery_app.py: Celery configuration with Redis backend
tasks.py: Asynchronous tasks for resume parsing and interview analysis
interview_analysis_task.py: Specialized task for processing interview results

AI Interview System:

ai_interviewer_agent.py: LiveKit-based voice interview agent using OpenAI, Deepgram, and Cartesia
app/services/agent_manager.py: Singleton manager for controlling the AI agent lifecycle
Agent runs as a single process, handling one interview at a time (hackathon limitation)
Inter-process communication via JSON command files
Automatic startup/shutdown with FastAPI application lifecycle

RAG System (rag/):

vector_store.py: Milvus vector database integration for resume search
llm/model.py: OpenAI GPT integration for resume parsing and interview plan generation
service/model.py: RAG service orchestration

Database Schema

Key Models:

Resume: Candidate resumes with parsing status, interview plans, and file storage
InterviewSession: LiveKit rooms with AI agent process tracking
Vacancy: Job postings with requirements and descriptions
Session: User session management with cookie-based tracking

Status Enums:

ResumeStatus: pending → parsing → parsed → interview_scheduled → interviewed
InterviewStatus: created → active → completed/failed

External Dependencies

Required Services:

PostgreSQL: Primary database with asyncpg driver
Redis: Celery broker and caching layer
Milvus: Vector database for semantic search (optional, has fallbacks)
S3-compatible storage: Resume file storage

API Keys:

OpenAI: Required for resume parsing and LLM operations
Deepgram/Cartesia/ElevenLabs: Optional voice services (has fallbacks)
LiveKit credentials: For interview functionality

Development Workflow

Resume Processing Flow

File upload via /api/v1/resume/upload
Celery task processes file and extracts text
OpenAI parses resume data and generates interview plan
Vector embeddings stored in Milvus for search
Status updates tracked through enum progression

Interview System Flow

AI agent starts automatically with FastAPI application
Validate resume readiness via /api/v1/interview/{id}/validate
Check agent availability (singleton, one interview at a time)
Generate LiveKit token via /api/v1/interview/{id}/token
Assign interview session to agent via command files
Conduct real-time voice interview through LiveKit
Agent monitors for end commands or natural completion
Session cleanup and agent returns to idle state

Configuration Management

Settings via app/core/config.py with Pydantic BaseSettings
Environment variables loaded from .env file (see .env.example)
Database URLs and API keys configured per environment

Important Notes

AI agent runs as a singleton process, handling one interview at a time
Agent lifecycle is managed automatically with FastAPI startup/shutdown
Interview sessions require LiveKit server to be running
Agent communication happens via JSON files (agent_commands.json, session_metadata_*.json)
Resume parsing is asynchronous and status should be checked via polling
Vector search gracefully degrades if Milvus is unavailable
Session management uses custom middleware with cookie-based tracking

Agent Management API

# Check agent status
GET /api/v1/admin/agent/status

# Start/stop/restart agent manually
POST /api/v1/admin/agent/start
POST /api/v1/admin/agent/stop
POST /api/v1/admin/agent/restart

4.9 KiB Raw Blame History