SEO and content teams
Metadata audits
Export titles, meta keywords, and descriptions from a controlled set of pages before migrations, campaign launches, or competitor page reviews.
Limited Time — Lifetime Access for just $99. Lock in before prices rise.
The HTML scraper template turns a reviewed URL list into a structured CSV of page metadata and full document source. It is built for teams that need to scrape HTML, inspect page titles and meta tags, or export rendered source code without writing a crawler or sending every page through a cloud scraping queue.
CSV
5
URL list
5 blocks
Free import
At a glance
This template is a generic HTML source scraper. It does not depend on one marketplace, directory, or search engine. Replace the sample URLs with your own reviewed pages and collect original_url, title, keywords, description, and source_code in one spreadsheet-friendly export.
Who uses it
SEO and content teams
Metadata audits
Export titles, meta keywords, and descriptions from a controlled set of pages before migrations, campaign launches, or competitor page reviews.
Data engineers
Parser prototyping
Capture real page source into CSV so downstream code can be tested against stable examples before a production crawler is approved.
Compliance reviewers
Evidence snapshots
Keep original URLs beside the rendered document source so review teams can trace where a page-level record came from.
How to use
Download and import
Download the hosted HTML scraper JSON, then import it into the UScraper local desktop app.
Replace the URL list
Open the Navigate block and swap the sample URLs for the pages you are allowed to process.
Confirm the export path
The Structured Export block writes html-scraper.csv with headers and append mode enabled.
Run the workflow
UScraper navigates, waits for load, checks the html node, exports the mapped fields, and loops to the next URL.
Open and QA the CSV
Review redirects, blank metadata, oversized source fields, and HTTP error pages before sending the file into Excel, Sheets, BI tools, or parsers.
Output preview
html-scraper.csvColumn
original_url
Configured URL matched back to the current browser location.
Column
title
The page title from document.title.
Column
keywords
Content from the meta keywords tag when present.
Column
description
Meta description, falling back to og:description.
Column
source_code
Full document outer HTML from the rendered page.
Sample rows
3 of many
| original_url | title | keywords | description | source_code |
|---|---|---|---|---|
| Tweet Scraper - Apify | twitter scraper, tweets, social data | Extract public tweet and profile data for research workflows. | <html lang="en">...</html> | |
| Intercom API | API identity and authorization endpoint response page. | <html>...status or redirect content...</html> | ||
| Request Error | <html>...error page source...</html> |
Pair this raw-source workflow with the Universal Content Scraper when you need cleaned text, the Contact Details Scraper for emails and phone numbers, and the Google SERP Scraper when search results are the source of your URL list. You can browse more workflows in the UScraper template library.
This UScraper template
LocalHosted scraper services
CloudRuns in your local browser session
Useful when URLs and source code should stay in your environment.
Runs on vendor workers
Convenient for scale, but data crosses another platform.
Append-ready CSV on disk
Open immediately in spreadsheet tools or feed into parsers.
Dashboard or API export
Often tied to accounts, credits, and remote storage.
Free template import
Use with your UScraper plan.
Usage-based billing
Large HTML payloads may consume credits quickly.
Public HTML can still be covered by site terms, robots rules, copyright, database rights, privacy laws, and anti-abuse controls. Use modest request volume, avoid bypassing access controls, keep source context with each row, and get legal advice before redistributing or reselling captured source code.
Before you run
Guardrails for reliable HTML source exports
Start with short, reviewed batches
Large HTML pages can be slow to load and heavy to store. Test ten URLs first, then expand only after the CSV rows and source sizes look reasonable.
Metadata can move or disappear
Titles are reliable, but meta keywords are often absent and descriptions may fall back to Open Graph tags. Keep blank cells visible during QA.
Respect source-site rules
Review each site's terms, robots signals, and access rules before commercial reuse, especially if the HTML contains personal data, copyrighted text, or protected database material.
Download and use this template instantly
UScraper templates are open source. Improve this workflow or contribute a new one to help the community grow.
Contribute on GitHubBrowse more templates in the library
All TemplatesHere are some of our most common questions. Can't find what you're looking for?
View All FAQsDownload UScraper and build your first web scraper in under 10 minutes. No subscriptions, no code, no limits.
Available on Windows 10+ and macOS 12+ · Need help? [email protected]