Tools Reference - JinsongRoh/pydoll-mcp GitHub Wiki
PyDoll MCP Server provides 75+ powerful tools for complete browser automation. This page is a detailed reference guide for all tools.
- [🌐 Browser Management (8 tools)](#-browser-management-8-tools)
- [🧭 Navigation and Page Control (10 tools)](#-navigation-and-page-control-10-tools)
- [🎯 Element Finding and Interaction (15 tools)](#-element-finding-and-interaction-15-tools)
- [📸 Screenshots and Media (6 tools)](#-screenshots-and-media-6-tools)
- [⚡ JavaScript and Advanced Scripting (8 tools)](#-javascript-and-advanced-scripting-8-tools)
- [🛡️ Security Bypass and Stealth (12 tools)](#-security-bypass-and-stealth-12-tools)
- [🌐 Network Control and Monitoring (10 tools)](#-network-control-and-monitoring-10-tools)
- [📁 File and Data Management (8 tools)](#-file-and-data-management-8-tools)
Start Chrome/Edge browser with advanced configurations.
Usage:
Start a new Chrome browser
Start browser in headless mode
Start browser with ad blocking mode
Key Options:
-
browser_type
: Choose chrome or edge -
headless
: No screen display -
block_ads
: Ad blocking -
stealth_mode
: Detection prevention -
window_size
: Window size settings
Safely terminate browser with resource cleanup.
Usage:
Stop the browser
Close all browser windows
Create isolated tabs with custom settings.
Usage:
Open a new tab
Open a new tab in background
Open a new tab to https://example.com
Close specific tab and release resources.
Usage:
Close the current tab
Close the first tab
Display all browser instances and their status.
Usage:
Show list of open browsers
Display detailed tab information.
Usage:
Show all tab lists
Show tabs of current browser
Support smooth switching between tabs.
Usage:
Switch to the first tab
Activate the second tab
Provide comprehensive browser status report.
Usage:
Check browser status
Show browser performance information
Intelligent URL navigation with load detection.
Usage:
Navigate to https://google.com
Go to Naver
Intelligent page refresh with cache control.
Usage:
Refresh the page
Refresh page ignoring cache
Support navigation through browser history.
Usage:
Go back
Go forward
Go back 2 steps
Provide advanced page readiness detection.
Usage:
Wait until page load is complete
Wait until DOM content is loaded
Return current page URL with validation.
Usage:
Tell me the current page URL
Extract complete HTML source.
Usage:
Get page source
Show HTML code
Extract page title and metadata.
Usage:
Tell me the page title
Monitor and wait for network activity.
Usage:
Wait until network activity is complete
Provide viewport size settings for responsive design testing.
Usage:
Set viewport size to 1920x1080
Set viewport to mobile size
Provide comprehensive page analysis.
Usage:
Analyze page information
Show all page metadata
Revolutionary element finding using natural language attributes.
Usage:
Find the login button
Find the email input field
Find button with text "Search"
Supported Attributes:
-
text
: Text content -
id
: Element ID -
class_name
: CSS class -
tag_name
: HTML tag -
css_selector
: CSS selector -
xpath
: XPath expression -
placeholder
: Input field placeholder -
aria_label
: Accessibility label
Click functionality simulating human-like behavior.
Usage:
Click the login button
Double-click the search button
Right-click the menu
Click Types:
-
left
: Left click (default) -
right
: Right click -
double
: Double click -
middle
: Middle click
Provide realistic text input simulation.
Usage:
Type "Python programming" in search box
Slowly type "[email protected]" in email field
Clear password field and type "newpassword"
Options:
-
clear_first
: Clear existing text -
human_like
: Human-like typing -
typing_speed
: Typing speed control
Provide advanced keyboard input handling.
Usage:
Press Enter key
Press Ctrl+C
Press Tab key 3 times
Supported Keys:
- Regular keys:
a
,b
,1
,2
, etc. - Special keys:
Enter
,Tab
,Escape
,Space
- Arrow keys:
ArrowUp
,ArrowDown
,ArrowLeft
,ArrowRight
- Function keys:
F1
,F2
, etc. - Combination keys:
Ctrl+C
,Ctrl+V
,Alt+Tab
Provide intelligent text extraction.
Usage:
Get text from title element
Extract text from all links
Extract attribute values from elements.
Usage:
Get href attribute from link
Show all attributes of image
Provide intelligent element waiting conditions.
Usage:
Wait until loading spinner disappears
Wait until login button appears
Wait Conditions:
-
visible
: Element is visible -
hidden
: Element is hidden -
enabled
: Element is enabled -
disabled
: Element is disabled
Smooth scroll functionality with viewport management.
Usage:
Scroll to bottom button
Scroll to page center
Provide natural mouse hover simulation.
Usage:
Hover over menu item
Hover on dropdown menu
Handle dropdown and select boxes.
Usage:
Select "South Korea" from country selection
Select "Korean" from language options
Provide comprehensive visibility testing.
Usage:
Check if login button is visible
Check if error message is displayed
Support advanced drag and drop operations.
Usage:
Drag file to upload area
Drag item to new position
Provide full page capture with options.
Usage:
Take screenshot of current page
Take full page screenshot
Take viewport-only screenshot
Options:
-
full_page
: Capture full page -
viewport_only
: Capture viewport only -
format
: Choose PNG or JPEG -
quality
: JPEG quality settings
Provide precise element capture.
Usage:
Take screenshot of login form
Take screenshot of ad area
Provide professional PDF generation.
Usage:
Save current page as PDF
Generate PDF in A4 size
Provide complete page archiving.
Usage:
Save entire page as HTML
Save page including resources
Provide screen recording functionality.
Usage:
Start screen recording
Record screen for 10 seconds
Stop screen recording
Provide image extraction and processing.
Usage:
Extract all images from page
Extract only PNG images
Provide complete JavaScript execution environment.
Usage:
Change page title with JavaScript
Scroll to bottom of page
Support external library injection.
Usage:
Inject jQuery library
Add Lodash to page
Create and execute custom data extraction scripts.
Usage:
Extract all product names and prices from page
Extract news article titles and links
Automate form completion with provided data.
Usage:
Automatically fill login form
Fill registration form with data
Monitor DOM changes and trigger callbacks.
Usage:
Monitor page changes
Detect changes in specific element
Execute script sequences with conditional logic.
Usage:
Execute multiple scripts sequentially
Execute different scripts based on conditions
Create and register custom JavaScript functions.
Usage:
Create custom function
Register function for specific task
Analyze page performance metrics and provide optimization suggestions.
Usage:
Analyze page performance
Measure loading time
Provide advanced anti-detection features.
Usage:
Enable stealth mode
Turn on detection prevention
Provide automatic Turnstile solving.
Usage:
Bypass Cloudflare security
Solve Turnstile captcha
Provide intelligent reCAPTCHA v3 bypass.
Usage:
Bypass reCAPTCHA
Automatically solve captcha
Simulate realistic user patterns.
Usage:
Simulate human-like behavior
Set natural mouse movements
Randomize browser fingerprint.
Usage:
Randomize browser fingerprint
Generate new browser ID
Provide common challenge solving.
Usage:
Handle bot challenges
Pass security checks
Provide comprehensive evasion techniques.
Usage:
Evade detection
Bypass bot detection
Provide real-time security analysis.
Usage:
Monitor security status
Analyze security systems
Provide dynamic IP address changing.
Usage:
Rotate proxies
Change IP address
Randomize user agent.
Usage:
Change user agent
Disguise as different browser
Manipulate request headers.
Usage:
Change HTTP headers
Spoof browser headers
Apply human-like timing patterns.
Usage:
Randomize click timing
Apply natural delay times
Provide comprehensive traffic analysis.
Usage:
Monitor network traffic
Track all HTTP requests
Provide real-time request modification.
Usage:
Intercept API requests
Modify specific URL requests
Provide automatic API capture.
Usage:
Extract API response data
Capture JSON responses
Provide dynamic header injection.
Usage:
Modify request headers
Add Authorization header
Provide resource blocking for performance.
Usage:
Block ad resources
Block image loading
Simulate throttling and latency.
Usage:
Simulate slow network
Set to mobile network speed
Provide detailed activity reports.
Usage:
Show network logs
Check failed requests
Provide WebSocket connection tracking.
Usage:
Monitor WebSocket connections
Track real-time data
Analyze page performance metrics.
Usage:
Analyze page loading performance
Measure network performance
Control browser cache.
Usage:
Clear browser cache
Disable cache
Provide advanced file upload handling.
Usage:
Upload document.pdf to file selection button
Upload image to upload area
Provide controlled downloads with progress indication.
Usage:
Download linked file
Download PDF document
Provide structured data extraction.
Usage:
Extract table data
Extract list items
Extract product information in structured format
Provide multi-format data export.
Usage:
Export data as CSV
Save data in JSON format
Provide configuration import/export.
Usage:
Import browser settings
Export current settings
Provide session state management.
Usage:
Save current session
Restore previous session
Provide complete state backup.
Usage:
Backup browser state
Save cookies and sessions
Provide state restoration.
Usage:
Restore browser state
Load backed up session
Start browser →
Navigate to Google →
Search for "Python" →
Click first result
If login button is visible, click it
If error message appears, take screenshot
Extract all links from page
Export table data as CSV
Start browser in headless mode
Load page with images and ads blocked
Load page while monitoring network traffic
Enable stealth mode and access site
Click button with natural mouse movement
Automatically bypass Cloudflare security
1. Navigate to shopping site
2. Search for products
3. Compare prices
4. Add to cart
5. Automate checkout process
1. Access social media platform
2. Search keywords
3. Collect post data
4. Extract data for sentiment analysis
1. Enable stealth mode
2. Access target site
3. Bypass security
4. Extract data
5. Save in structured format
1. Execute test scenarios
2. Auto-fill forms
3. Verify results
4. Generate screenshots
5. Create test reports
- [🎯 Basic Usage](Basic-Usage) - Learn basic commands
- [🛡️ Security Bypass](Security-Bypass) - Captcha and security bypass
- [🌐 Network Control](Network-Control) - Network features
- [📊 Data Extraction](Data-Extraction) - Data collection methods
- [📜 Script Examples](Script-Examples) - Real use cases
- [⚙️ Advanced Configuration](Advanced-Configuration) - Performance optimization
- [🔍 Troubleshooting](Troubleshooting) - Common problem solving
Automate anything with 75+ powerful tools! 🚀
Basic Usage |
Advanced Configuration |
View Examples