Features & Capabilities - sbuddharaju369/WebsiteAnalyzer GitHub Wiki
Intelligent Sidebar Navigation
Three-Panel Control System
The interface features a sophisticated collapsible sidebar with three main control panels accessed via clean icon-based buttons:
- Web Crawler Panel: Configure and launch intelligent website crawling with real-time progress tracking
- Cache Manager: Browse and reload previously analyzed websites with instant access to saved embeddings
- Content Overview: Monitor crawling statistics, website coverage metrics, and performance analytics
Real-Time Crawling Experience
Live Progress Dashboard
Watch your website analysis unfold in real-time with comprehensive progress tracking that shows pages discovered, content extracted, current page being processed, and estimated time remaining. Performance charts display crawling velocity and success rates as the system works.
Intelligent Configuration
Fine-tune crawling behavior with intuitive controls for maximum page limits (1-100), request delays (0.5-5 seconds), and automatic website size estimation that analyzes sitemaps to predict total discoverable pages before crawling begins.
Advanced Question Interface
AI-Powered Interaction Zone
The main interface features a full-width question area where you can ask natural language questions about the crawled content. Smart suggested questions are automatically generated based on the actual website content, making it easy to explore key topics without guessing what to ask.
Configurable Response Intelligence
Choose your preferred answer style with three sophistication levels: Concise (quick focused answers), Balanced (comprehensive but digestible), or Comprehensive (detailed analysis with examples). Each response includes confidence scoring and source attribution with clickable links back to original pages.
Multi-Tab Analytics Dashboard
Content Analytics Tab
View sophisticated visualizations including word count distributions across pages, site depth analysis, page relationship network graphs, and key performance metrics. Interactive charts reveal content patterns and website structure at a glance.
Semantic Search Tab
Perform direct content searches without AI interpretation. Enter keywords or phrases to find relevant page sections with similarity scoring and content previews. Results show relevance rankings and direct access to source material.
Raw Content Browser
Explore the complete crawled dataset with a page-by-page browser. Select any page to view its full extracted content, metadata, word counts, and how it was processed into chunks for analysis. Perfect for content auditing and verification.
Smart Cache Management
Human-Readable File Organization
Previously analyzed websites are automatically saved with intelligent naming like "verizon_jun-19-2025_3-45pm_25pages.json" and displayed as "verizon.com - June 19, 2025 at 3:45 PM (25 pages)" for easy identification and reloading.
Instant Reload Capability
Cached analyses include embedded OpenAI embeddings, enabling instant reactivation of the AI question-answering interface without re-processing or additional API calls.
Visual Intelligence Features
Interactive Network Graphs
Automatically generated network visualizations show page relationships and site structure with color-coded nodes sized by content volume and connected based on link relationships and content similarity.
Performance Metrics Display
Real-time counters show total pages discovered, content successfully extracted, processing speed, and coverage percentage as analysis progresses. Progress bars and charts provide immediate feedback on crawling effectiveness.
Confidence & Reliability System
Answer Quality Indicators
Every AI response includes confidence scoring displayed as user-friendly reliability indicators (Very Reliable, Mostly Reliable, etc.) with detailed explanations of what influences answer quality and tips for improving results.
Source Attribution
All answers are backed by specific page citations with relevance scores, allowing you to verify information and explore source material directly. Multiple sources are combined intelligently to provide comprehensive answers.
The interface transforms complex web crawling and AI analysis into an intuitive, visual experience that makes website intelligence accessible to non-technical users while providing the depth and control that power users demand.