Planning Roadmap - huqianghui/AI-Coach-vibe-coding GitHub Wiki

Project Roadmap

Auto-generated from .planning/ROADMAP.md
Last synced: 2026-04-02

Roadmap: AI Coach Platform (BeiGene)

Overview

This roadmap delivers the AI Coach Platform in 6 phases: foundation and architecture first, then F2F text coaching (the core value loop), scoring & assessment enhancements, dashboards & reporting, training material management, and conference presentation module. Each phase builds on the previous one and delivers a coherent, demonstrable capability. Architecture-first principle applies throughout -- pluggable adapters, config-driven features, and shared components are established in Phase 01 and extended by every subsequent phase.

Phases

Phase Numbering:

Zero-padded phases (01, 02, 03): Planned milestone work
Decimal phases (01.1, 02.1): Urgent insertions (marked with INSERTED)

Decimal phases appear between their surrounding integers in numeric order.

Phase 01: Foundation, Auth, and Design System - Pluggable architecture, authentication, shared UI components, i18n, and responsive app shell
Phase 02: F2F Text Coaching and Scoring - HCP/scenario configuration, text-based coaching simulation, multi-dimensional scoring, and feedback
Phase 03: Scoring & Assessment - Real-time coaching suggestions, post-session reports, customizable scoring rubrics
Phase 04: Dashboard & Reporting - Personal dashboard, group analytics, export (PDF/Excel), training progress tracking
Phase 05: Training Material Management - Document upload, versioning, retention policies, AI knowledge base integration
Phase 06: Conference Presentation Module - One-to-many simulation, live transcription, audience Q&A, presentation scoring (completed 2026-03-25)
Phase 07: Azure Service Integration - Admin Azure config persistence, real connection testing, dynamic provider switching (mock → Azure OpenAI/Speech/Avatar) (completed 2026-03-27)
Phase 08: Voice & Avatar Demo Integration - Integrate Azure Voice Live Agent with Avatar into the AI Coach platform for real-time voice coaching with digital HCP avatar (completed 2026-03-28)
Phase 09: Integration Testing with Real Azure Services - Unified AI Foundry config, 7 interaction modes, agent mode runtime, integration tests, E2E demo validation (completed 2026-03-29)
Phase 10: UI Polish & Professional Unification - Comprehensive UI overhaul for professional appearance, unified design language, polished visuals for BeiGene customer demo (completed 2026-03-29)
Phase 11: HCP Profile Agent Integration - Auto-create AI Foundry agent when adding HCP profiles, bidirectional sync, table UI redesign (completed 2026-03-31)

Phase Details

Phase 01: Foundation, Auth, and Design System

Goal: A running application with login, responsive layout shell, shared component library, i18n framework, and pluggable architecture for all AI services -- the scaffold everything else builds on Depends on: Nothing (first phase) Requirements: ARCH-01, ARCH-02, ARCH-03, ARCH-04, ARCH-05, AUTH-01, AUTH-02, AUTH-03, AUTH-04, UI-01, UI-02, UI-07, PLAT-01, PLAT-02, PLAT-04, PLAT-05 Success Criteria (what must be TRUE):

User can log in with username/password and see a responsive app shell with sidebar navigation -- session persists across browser refresh
Admin and User roles exist -- admin sees admin routes, user does not
All UI text is externalized via react-i18next and the app can switch between zh-CN and en-US
AI service adapters (LLM, STT, TTS, Avatar) use pluggable provider pattern -- a mock provider works end-to-end without any Azure credentials
Feature toggles, Azure service endpoints, voice mode selection, and region configuration are driven by config (not hardcoded) -- changing config changes behavior without code changes Plans: 5 plans

Plans:

01-01-PLAN.md -- Backend JWT auth: User model, login/me endpoints, role-based access, seed data
01-02-PLAN.md -- Design tokens + UI component library: Figma Make theme adaptation, 17 shadcn/ui components
01-03-PLAN.md -- Pluggable AI adapters + config: STT/TTS/Avatar base+mock, ServiceRegistry, feature toggles, config API
01-04-PLAN.md -- Frontend shell: React bootstrap, i18n, login page, user/admin layouts, router with auth guards
01-05-PLAN.md -- Integration wiring: auto-register adapters, config context, integration tests, full verification

UI hint: yes

Phase 01.1: UI Figma Alignment (INSERTED)

Goal: Align existing frontend with 5 Figma Make generated screens -- login polish, full user dashboard, scenario selection page, F2F training session page, and 11 new shared domain components. All pages use i18n, design tokens, and mock data (backend integration deferred to Phase 2). Requirements: UI-01, UI-02, UI-03, UI-04, UI-05 Depends on: Phase 1 Plans: 4/6 plans executed

Plans:

01.1-01-PLAN.md -- Install Radix deps, 4 new UI base components (ScrollArea, Tabs, Progress, Textarea), i18n namespaces
01.1-02-PLAN.md -- Login page polish: SVG logo, card shadow, auth layout gradient/language switcher/copyright
01.1-03-PLAN.md -- 11 shared domain components: StatCard, SessionItem, ActionCard, HCPProfileCard, ChatBubble, ChatInput, etc.
01.1-04-PLAN.md -- User dashboard page + scenario selection page + route registration
01.1-05-PLAN.md -- F2F training session: 3 coach panels + full-screen training page + route registration
01.1-06-PLAN.md -- Build validation + visual verification checkpoint

Phase 02: F2F Text Coaching and Scoring

Goal: An MR can select a scenario, have a text-based F2F conversation with an AI HCP that behaves according to its profile, and receive a multi-dimensional scored feedback report after the session Depends on: Phase 01 Requirements: HCP-01, HCP-02, HCP-03, HCP-04, HCP-05, COACH-01, COACH-02, COACH-03, COACH-08, COACH-09, SCORE-01, SCORE-02, SCORE-03, SCORE-04, SCORE-05, UI-03, UI-05, PLAT-03 Success Criteria (what must be TRUE):

Admin can create HCP profiles (personality, specialty, objections, communication style) and training scenarios (product, key messages, difficulty, scoring weights) from the web UI
User can browse and select a training scenario, then start a text-based F2F coaching session with the assigned AI HCP
AI HCP responds in character (personality, knowledge, objections) based on its profile -- conversation feels realistic and contextual
During the session, a side panel shows real-time key message delivery checklist and coaching hints
After session completion, user sees a multi-dimensional scoring report with per-dimension scores, strengths/weaknesses with conversation quotes, and actionable improvement suggestions Plans: 8 plans

Plans:

02-01-PLAN.md -- Backend data models, Pydantic schemas, Alembic migration, sse-starlette dependency
02-02-PLAN.md -- Frontend TypeScript types, i18n namespaces (coach/admin/scoring), Slider component, recharts install
02-03-PLAN.md -- Backend HCP profile + scenario CRUD API routers, service layer, seed data
02-04-PLAN.md -- Backend session lifecycle + SSE streaming chat + scoring service + enhanced mock adapter
02-05-PLAN.md -- Frontend API client modules + TanStack Query hooks + SSE streaming hook
02-06-PLAN.md -- Admin pages: HCP profile management, scenario management, Azure config
02-07-PLAN.md -- User pages: scenario selection, F2F coaching session with live chat, scoring feedback with radar chart
02-08-PLAN.md -- Integration wiring: router, admin sidebar, Azure config API, full flow verification

UI hint: yes

Phase 03: Scoring & Assessment

Goal: Complete the scoring system with real-time coaching suggestions during sessions, detailed post-session reports with strengths/weaknesses/improvement areas, and admin-customizable scoring criteria/rubrics Depends on: Phase 02 Requirements: SCORE-01, SCORE-02, SCORE-03, SCORE-04, SCORE-05, COACH-08, COACH-09 Success Criteria (what must be TRUE):

During a coaching session, the system provides real-time suggestions and coaching tips in the side panel based on conversation context
After session completion, user sees a detailed post-session report with strengths, weaknesses, conversation quotes, and actionable improvement areas
Admin can configure customizable scoring rubrics -- defining dimensions, weights, and criteria per scenario type
Scoring results are persisted and queryable for historical trend analysis
All new code has unit tests with >=95% coverage maintained Plans: 4 plans

Plans:

03-01-PLAN.md -- Backend services: rubric CRUD, scoring rubric integration, SSE suggestion wiring, report/suggestions/history endpoints, tests
03-02-PLAN.md -- Frontend data layer: TypeScript types, API clients, TanStack Query hooks for rubrics/reports/history
03-03-PLAN.md -- Frontend pages: admin rubric management, enhanced scoring feedback with full report + PDF, session history
03-04-PLAN.md -- Integration wiring: router registration, sidebar nav, seed default rubric, full flow verification

UI hint: yes

Phase 04: Dashboard & Reporting

Goal: MRs can track their improvement over time via a personal dashboard, and admins can view organization-level analytics with export capabilities Depends on: Phase 03 Requirements: UI-04, UI-06, ANLYT-01, ANLYT-02, ANLYT-03, ANLYT-04, ANLYT-05 Success Criteria (what must be TRUE):

User can view a personal dashboard with score overview, recent sessions, and a skill radar chart showing multi-dimensional performance
User can view session history (date, scenario, score, duration) and personal performance trends over time per scoring dimension
Admin can view organization-level analytics including BU comparisons, skill gap heatmaps, and training completion rates
Reports can be exported as PDF/Excel for offline review
All new code has unit tests with >=95% coverage maintained Plans: 6 plans

Plans:

04-01-PLAN.md -- Backend foundation: Alembic migration (business_unit), analytics schemas, analytics service, export service, recommendation engine
04-02-PLAN.md -- Frontend data layer: TypeScript types, API client, TanStack Query hooks, i18n analytics namespace, file-saver install
04-03-PLAN.md -- Backend API: analytics router with 7 endpoints, main.py registration, seed data with BU values
04-04-PLAN.md -- Frontend user pages: enhanced dashboard with live stats, session history with skill radar, chart components
04-05-PLAN.md -- Frontend admin pages: org analytics dashboard, reports page, BU bar chart, skill gap heatmap, route registration, backend tests
04-06-PLAN.md -- Gap closure: seed session data, wire reports pages to live data, date range filtering, PDF print export

UI hint: yes

Phase 05: Training Material Management

Goal: Admin can upload, version, and manage training materials (Word/Excel/PDF) organized by product -- materials feed into AI knowledge base for more accurate HCP simulation Depends on: Phase 02 Requirements: CONTENT-01, CONTENT-02, CONTENT-03 Success Criteria (what must be TRUE):

Admin can upload training documents (Word, Excel, PDF) organized by product via the web UI
Uploaded materials support versioning and archiving -- admin can see version history and restore previous versions
Retention policies enable auto-deletion of expired materials per configurable rules
Uploaded materials are indexed and available to the AI knowledge base for enhanced HCP simulation accuracy
All new code has unit tests with >=95% coverage maintained Plans: 3 plans

Plans:

05-01-PLAN.md -- Backend foundation: ORM models, Pydantic schemas, storage adapter, text extractor, Alembic migration, new dependencies
05-02-PLAN.md -- Backend API: material service, REST router, prompt builder RAG integration, comprehensive tests
05-03-PLAN.md -- Frontend: TypeScript types, API client, TanStack Query hooks, admin page with drag-and-drop upload, i18n, route registration

UI hint: yes

Phase 06: Conference Presentation Module

Goal: MRs can practice conference presentations to multiple virtual HCP audience members with turn management, live transcription, Q&A, and multi-scenario scoring Depends on: Phase 02 Requirements: CONF-01, CONF-02, CONF-03, CONF-04, COACH-04, COACH-05, COACH-06, COACH-07 Success Criteria (what must be TRUE):

User can start a conference presentation session with multiple virtual HCP audience members (one-to-many simulation)
Live transcription displays audio-to-text on screen during the presentation
Virtual HCP audience members ask contextual questions with turn management
Conference presentations are scored using the multi-dimensional scoring system with presentation-specific criteria
All new code has unit tests with >=95% coverage maintained Plans: 6 plans

Plans:

06-01-PLAN.md -- Backend foundation: ConferenceAudienceHcp model, session/message extensions, Alembic migration, schemas, TurnManager, voice dependency
06-02-PLAN.md -- Frontend data layer: TypeScript types, API client, TanStack Query hooks, multi-speaker SSE hook, i18n conference namespace
06-03-PLAN.md -- Backend services + API: conference_service, conference router with SSE, prompt builder extension, Azure STT/TTS adapters
06-04-PLAN.md -- Frontend components + page: 11 conference components, extended ChatBubble, full-screen conference session page
06-05-PLAN.md -- Integration wiring: route registration, navigation, seed data, admin audience config, full-flow verification
06-06-PLAN.md -- Comprehensive backend tests: TurnManager, conference service, API integration, STT/TTS adapters, schemas, models (>=95% coverage)

UI hint: yes

Phase 07: Azure Service Integration

Goal: Admin Azure config persistence, real connection testing, dynamic provider switching (mock to Azure OpenAI/Speech/Avatar) Depends on: Phase 01 Requirements: PLAT-03, ARCH-05 Success Criteria (what must be TRUE):

Admin can configure Azure service endpoints and API keys via admin UI
API keys are stored encrypted (Fernet) in the database
Connection testing validates Azure service reachability
Dynamic provider switching allows runtime change from mock to Azure providers
All new code has unit tests with >=95% coverage maintained Plans: 4 plans

Plans:

07-01-PLAN.md -- Config data foundation: ServiceConfig model, Fernet encryption, config service, schemas, migration
07-02-PLAN.md -- Admin config API routes and frontend config page
07-03-PLAN.md -- Connection testing and Azure service validation
07-04-PLAN.md -- Dynamic provider switching and runtime reconfiguration

UI hint: yes

Progress

Execution Order: Phases execute in numeric order: 01 -> 01.1 -> 02 -> 03 -> 04 -> 05 -> 06 -> 07 -> 08 -> 09 -> 10 -> 11

Phase	Plans Complete	Status	Completed
01. Foundation, Auth, and Design System	5/5	Complete	-
01.1. UI Figma Alignment	6/6	Complete	-
02. F2F Text Coaching and Scoring	8/8	Complete	-
03. Scoring & Assessment	4/4	Complete	-
04. Dashboard & Reporting	6/6	Complete	-
05. Training Material Management	3/3	Complete	-
06. Conference Presentation Module	6/6	Complete	2026-03-25
07. Azure Service Integration	4/4	Complete	2026-03-27
08. Voice & Avatar Demo Integration	5/4	Complete	2026-03-28
09. Integration Testing with Real Azure Services	5/5	Complete	2026-03-29
10. UI Polish & Professional Unification	6/6	Complete	2026-03-29
11. HCP Profile Agent Integration	3/3	Complete	2026-03-31

Phase 07: Azure Service Integration

Goal: Admin can configure Azure OpenAI, Speech, and Avatar through the web UI with real connection testing, configurations persist to the database, and the coaching system dynamically switches from mock to real Azure providers based on admin settings Depends on: Phase 02 Requirements: PLAT-03, ARCH-05, PLAT-05 Success Criteria (what must be TRUE):

Admin can configure Azure OpenAI endpoint/key/model/region from the Azure Config page and the settings persist across server restarts (stored in database)
Admin can configure Azure Speech (STT/TTS) and Azure Avatar settings from the same page
"Test Connection" button actually validates connectivity to the configured Azure service and shows real success/failure status
When Azure OpenAI is configured and tested, F2F coaching sessions use the real Azure OpenAI model instead of mock responses
When Azure Speech is configured, voice mode becomes available for coaching sessions (STT for input, TTS for HCP responses)
The system gracefully falls back to mock adapters when Azure services are not configured or unavailable Plans: 4 plans

Plans:

07-01-PLAN.md -- Backend foundation: ServiceConfig model, Fernet encryption, schemas, Alembic migration, config service
07-02-PLAN.md -- AzureOpenAIAdapter: streaming LLM adapter with conversation history, unit tests
07-03-PLAN.md -- Backend API + dynamic switching: PUT/test/GET endpoints, connection tester, lifespan DB loading, session history wiring
07-04-PLAN.md -- Frontend wiring: TypeScript types, API client, TanStack Query hooks, wire azure-config page to real API

UI hint: yes

Phase 08: Voice & Avatar Demo Integration

Goal: Integrate the existing Voice-Live-Agent-With-Avatar demo (Azure Voice Live API + Avatar) into the AI Coach platform, enabling real-time voice-based coaching sessions where MRs talk to a digital HCP avatar with natural speech interaction Depends on: Phase 07 Requirements: COACH-04, COACH-05, COACH-07, EXT-04, PLAT-05 Success Criteria (what must be TRUE):

User can start a voice-enabled coaching session that uses Azure Voice Live API for real-time speech interaction with the AI HCP
Azure AI Avatar renders a digital human visual for the HCP during voice coaching sessions
Voice interaction is integrated with the existing coaching session lifecycle (start -> in_progress -> completed -> scored)
The system gracefully falls back to text-only or TTS-only mode when Avatar/Voice Live services are unavailable
Admin can configure Voice Live and Avatar settings from the Azure Config page
All new code has unit tests with >=95% coverage maintained Plans: 4 plans

Plans:

08-01-PLAN.md -- Backend foundation: Alembic migration (session mode), voice_live schemas/service, token broker API, connection tester, tests
08-02-PLAN.md -- Frontend data layer: TypeScript types, i18n voice namespace, API client, TanStack Query hooks, audio-processor.js, tests
08-03-PLAN.md -- Voice hooks + leaf components: useVoiceLive, useAvatarStream, useAudioHandler, 7 voice UI components, component tests
08-04-PLAN.md -- Container components + wiring: VoiceSession container, route registration, admin config Voice Live card, transcript flush, tests

UI hint: yes

Phase 09: Integration Testing with Real Azure Services

Goal: Implement unified AI Foundry config (replacing 8 separate ServiceConfig rows), expand to 7 interaction modes, wire agent mode runtime end-to-end, redesign admin UI with single AI Foundry card, then validate all Azure service integrations with real credentials and polish demo experience for BeiGene customer presentations Depends on: Phase 08 Requirements: COACH-04, COACH-05, COACH-06, COACH-07, PLAT-03, PLAT-05 Success Criteria (what must be TRUE):

Admin configures a single AI Foundry endpoint/region/API key — all 7 services derive from this unified config
Platform supports all 7 interaction modes (Text, Voice Pipeline, Digital Human Speech+Model, Voice Realtime Model, Digital Human Realtime Model, Voice Realtime Agent, Digital Human Realtime Agent)
Agent mode works end-to-end: token broker returns agent_id/project_name, frontend connects via voice-agent/realtime WebSocket
Two-level mode selector UI: communication type first (Text/Voice/Digital Human), then engine (Pipeline/Realtime Model/Realtime Agent)
Integration tests validate each Azure service with real credentials (pytest --run-integration)
E2E demo flow works: Login → Admin AI Foundry config → Text session → Voice session → Avatar session → Score report Plans: 5 plans

Plans:

09-01-PLAN.md -- Backend unified AI Foundry config, 7-mode session schema, agent mode token broker
09-02-PLAN.md -- Frontend types, AI Foundry admin page redesign, two-level mode selector types
09-03-PLAN.md -- Two-level mode selector component, agent mode WebSocket wiring in use-voice-live
09-04-PLAN.md -- Backend pytest integration tests (Azure OpenAI, Speech, Voice Live, Avatar)
09-05-PLAN.md -- Playwright E2E demo-flow test, pre-demo smoke test checklist

UI hint: yes

Phase 10: UI Polish & Professional Unification

Goal: Comprehensive UI overhaul for professional appearance and consistency across all pages — unified design language, accent color theme picker, page transitions, navigation polish, Figma-audited spacing/typography, and demo-ready seed data for BeiGene customer presentations Requirements: UI-01, UI-02, UI-03, UI-04, UI-05, UI-06, UI-07 Depends on: Phase 09 Plans: 6/6 plans complete

Plans:

10-01-PLAN.md -- Theme system foundation: 5 accent color CSS themes, theme store, flash prevention, splash screen, page transition keyframes
10-02-PLAN.md -- Navigation polish: ThemePicker, Breadcrumb, PageTransition components, grouped admin sidebar, active nav states, layout dark mode
10-03-PLAN.md -- Shared component audit: design token consistency, icon sizing, Badge success variant, Sonner theming, 404 page
10-04-PLAN.md -- User page audit: login, dashboard, training, session history, scoring, reports, training sessions vs Figma specs
10-05-PLAN.md -- Admin page audit: dashboard, users, HCP profiles, scenarios, rubrics, materials, reports, azure config, settings vs Figma specs
10-06-PLAN.md -- Demo seed data polish: BeiGene products, bilingual HCPs, final build verification

Phase 11: HCP Profile Agent Integration — Auto-create AI Foundry agent when adding HCP profiles

Goal: When admin creates/updates/deletes an HCP profile, the system automatically syncs a corresponding AI Foundry Agent. Digital Human Realtime Agent mode uses the HCP's agent_id to drive conversations. HCP profiles admin page is redesigned to table format with Agent sync status. Requirements: HCP-01, HCP-02, COACH-06, COACH-07, UI-06, PLAT-01, PLAT-03 Depends on: Phase 10 Plans: 3/3 plans complete

Success Criteria (what must be TRUE):

Admin can create/update/delete HCP profiles and the system automatically creates/updates/deletes a corresponding AI Foundry Agent
Agent sync status (synced/pending/failed/none) is visible per HCP profile in the admin table with error details on hover
Failed agent sync does not prevent HCP profile save -- status shows as "failed" with retry option
Token broker returns per-HCP agent_id for Digital Human Realtime Agent mode sessions
HCP profiles page uses sortable table layout with agent status column replacing the previous list+editor layout
All new UI text externalized to i18n in both en-US and zh-CN
All new code has unit tests with >=95% coverage maintained

Plans:

11-01-PLAN.md -- Backend foundation: HcpProfile agent columns, Alembic migration, agent_sync_service (AI Foundry REST API wrapper), schema updates
11-02-PLAN.md -- Backend wiring: HCP CRUD sync hooks, retry-sync endpoint, token broker HCP agent_id sourcing, integration tests
11-03-PLAN.md -- Frontend: TypeScript types, API client, hooks, i18n keys, HcpTable component, HCP profiles page rewrite (table layout)

UI hint: yes