π Examples of browser automation
Lets see how we can use the power of Playwright MCP Server to automate our browser and do webscrapping
Using Different Browser Typesβ
Playwright MCP now supports multiple browser engines. You can choose between Chromium (default), Firefox, and WebKit:
Given I navigate to website "https://example.com" using the "firefox" browser
And I take a screenshot named "firefox-example"
Then I navigate to website "https://example.com" using the "webkit" browser
And I take a screenshot named "webkit-example"
When you send these commands to Claude, it will open the website in Firefox first, take a screenshot, then switch to WebKit and take another screenshot, allowing you to compare how different browsers render the same website.
Scenario in BDD Formatβ
Given I navigate to website http://eaapp.somee.com and click login link
And I enter username and password as "admin" and "password" respectively and perform login
Then click the Employee List page
And click "Create New" button and enter realistic employee details to create for Name, Salary, DurationWorked,
Select dropdown for Grade as CLevel and Email.
Once I enter the above text in Claude Desktop Client I should see Claude desktop giving me prompt to perform operation by opening real browser like this
And once the entire test operation completes, we will be presented with the entire details of how the automation did happened.
Using Browser History Navigationβ
You can navigate through the browser's history using the new navigation controls:
Given I navigate to website "https://example.com"
When I navigate to website "https://example.com/about"
And I navigate back in browser history
Then the current page should be "https://example.com"
When I navigate forward in browser history
Then the current page should be "https://example.com/about"
Using Drag and Drop Functionalityβ
You can drag and drop elements using the new drag tool:
Given I navigate to website "https://example.com/drag-drop-demo"
When I drag element with id "draggable" to element with id "droppable"
Then I should see confirmation message "Dropped!"
Using Keyboard Interactionsβ
You can simulate keyboard presses with the new keyboard tool:
Given I navigate to website "https://example.com/form"
When I focus on the input field with id "search-box"
And I press the "Enter" key
Then the search results should appear
Saving Page as PDFβ
You can save the current page as a PDF file:
Given I navigate to website "https://example.com/report"
When I save the current page as a PDF in "/downloads" folder with name "report.pdf"
Then I should see confirmation that the PDF was saved
Advanced example with custom options:
Given I navigate to website "https://example.com/invoice"
When I save the page as PDF with the following settings:
| Setting | Value |
| ----------------- | --------- |
| Output Path | /downloads |
| Filename | invoice.pdf |
| Format | Letter |
| Print Background | true |
| Top Margin | 2cm |
| Right Margin | 1cm |
| Bottom Margin | 2cm |
| Left Margin | 1cm |
Then I should see confirmation that the PDF was saved
Extracting Page Contentβ
You can extract visible text content from the page:
Given I navigate to website "https://example.com/article"
When I extract all visible text from the page
Then I should see the article content in plain text without hidden elements
You can also get the complete HTML of the page:
Given I navigate to website "https://example.com/products"
When I extract the HTML content of the page
Then I should receive the complete HTML structure of the page
Example use case for content analysis:
Given I navigate to website "https://example.com/pricing"
When I extract all visible text from the page
Then I should be able to analyze the text to find pricing information
And I can determine if the "Enterprise" plan mentions "custom pricing"