PR and communications teams
Coverage archive
Store source, title, date, abstract, canonical URL, and body text for coverage summaries without pasting each article into a spreadsheet by hand.
Limited Time — Lifetime Access for just $99. Lock in before prices rise.
This Google News scraper turns a list of Google News result or article URLs into a structured CSV. Import the template into the UScraper local desktop app, replace the sample URLs, and export source, title, canonical URL, publish date, author, keywords, article text, video, and image fields without building a news API integration.
CSV
15
URL list
Built in
Free
At a glance
This template is built for teams that already have article URLs from Google News, alerts, research queues, or a separate headline scraper. It does not crawl the whole Google News site. Instead, it follows a controlled URL list, waits for each destination page, and exports the best available metadata from the publisher page itself.
That makes it useful when a headline feed is only the first step. PR teams can archive coverage with canonical links and dates, analysts can collect article bodies for review, and SEO researchers can compare how different publishers describe the same event. The export also includes an ErrorMessage column, so direct image URLs, blocked pages, or thin pages are visible instead of silently disappearing.
Article rows ready for spreadsheets
Export Google News article URLs into a CSV that includes identity fields, metadata, text, media links, and a diagnostic status.
One row per supplied URL
The Navigate block holds the article list, and Loop Continue advances through each input while Structured Export appends rows.
Local desktop workflow
The browser session runs on your machine and the CSV lands in the folder configured inside the export block.
Best-effort publisher parsing
The template reads Open Graph, Twitter card, canonical link, time tags, JSON-LD, article text, and image sources where publishers expose them.
Who this is for
PR and communications teams
Coverage archive
Store source, title, date, abstract, canonical URL, and body text for coverage summaries without pasting each article into a spreadsheet by hand.
Analysts and researchers
Thematic review
Compare articles across publishers, keep original URLs beside extracted text, and flag pages where the article body was missing or blocked.
SEO and content teams
Publisher discovery
Pair this page with the Google News Keyword Scraper when you need both headline feeds and deeper article exports.
How to use
Download and import
Download the hosted Google News scraper JSON template, then import it into UScraper.
Replace the sample URLs
Open the Navigate block and paste your approved Google News result URLs or publisher article URLs. Keep the first test batch small.
Confirm the export path
Structured Export writes google-news-scraper-cloud.csv with headers and append mode. Change the save folder before recurring runs.
Run the browser flow
UScraper navigates to a URL, waits for page load, waits for body, sleeps briefly, and then exports the row.
Open and review the CSV
Check article text, media URLs, and the ErrorMessage column before sharing the dataset downstream.
Automation path inside the template
Navigate
Visit each URL from the configured input list.
Wait and settle
Wait for load, confirm the page body exists, and pause briefly for late metadata or images.
Structured export
Append the article metadata, text, media fields, and diagnostic status into one CSV.
Loop continue
Move to the next URL until the list is complete.
Output preview
The output is designed for auditability. Title_URL stores the canonical URL when available, Start_URL keeps the URL actually opened, and ErrorMessage explains common failures such as direct image inputs or pages where article text was not detected.
google-news-scraper-cloud.csvColumn
Search_word
Query parameter found in the URL, when present.
Column
Source
Publisher or site name from metadata or hostname.
Column
Title
Open Graph title, Twitter title, H1, or document title.
Column
Title_URL
Canonical link when available, otherwise the current page URL.
Column
PublishDate
Published date from article metadata, time tags, or JSON-LD.
Column
Author
Author from meta tags or JSON-LD author objects.
Column
Start_URL
The exact URL opened by Navigate.
Column
Keywords
Meta keywords and article tags joined with semicolons.
Column
Abstract
Description metadata or first visible paragraph.
Column
Text
Article or main content paragraphs and list items.
Column
Video
Open Graph video or visible video source.
Column
First_image
Primary image from metadata or page content.
Column
Images
Unique metadata and page image URLs joined with semicolons.
Column
Text_by_XPath
Flattened article or main text fallback.
Column
ErrorMessage
Diagnostic message for non-article URLs or missing text.
Sample rows
1 of many
| Search_word | Source | Title | Title_URL | PublishDate | Author | Start_URL | Keywords | Abstract | Text | Video | First_image | Images | Text_by_XPath | ErrorMessage |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| labor policy | Example News | Senate vote expected next week on agency quorum | 2026-06-03T09:30:00Z | Policy Desk | labor; senate; agency | Lawmakers are expected to vote next week... | Full article paragraphs joined with blank lines. | Senate vote expected next week on agency quorum... |
| Column | Example value | Why it matters |
|---|---|---|
| Source | Example News | Groups rows by publisher. |
| Title_URL | https://example.com/insights/agency-quorum | Keeps the canonical article URL for citation. |
| Text | Full article paragraphs joined with blank lines. | Gives reviewers the article body when access is permitted. |
| Images | https://example.com/image.jpg;https://example.com/chart.png | Preserves media references for coverage review. |
| ErrorMessage | Article text not detected or page may be blocked | Flags rows that need manual QA. |
Pair this article extractor with the Google Search Scraper for SERP research, the Bing Search Results Scraper for cross-engine checks, and the full UScraper template library when collected URLs need enrichment from other sources.
Scraping public news pages can still be limited by Google policies, publisher terms, robots rules, copyright, privacy law, and your internal data policy. Keep runs modest, avoid login walls and paywalls, do not bypass verification challenges, and get legal review before resale, AI training, or republication.
Before you run
Guardrails for reliable news exports
Publisher pages can slow, block, or challenge repeated requests
Keep batches modest, avoid parallel runs, and treat login prompts, paywalls, CAPTCHA screens, and repeated empty text as stop conditions for unattended automation.
Publisher markup changes over time
Empty author, date, image, or article body fields usually mean the page exposes different metadata. Re-test a small batch before relying on recurring exports.
CSV output still needs responsible use
Review Google policies, publisher terms, robots guidance, client contracts, and internal acceptable-use rules before republishing, reselling, or training models on article text.
Install the local desktop app from uscraper.io/download, then use this template whenever you need to export Google News articles into a reviewable local CSV.
Download and use this template instantly
UScraper templates are open source. Improve this workflow or contribute a new one to help the community grow.
Contribute on GitHubBrowse more templates in the library
All TemplatesHere are some of our most common questions. Can't find what you're looking for?
View All FAQsDownload UScraper and build your first web scraper in under 10 minutes. No subscriptions, no code, no limits.
Available on Windows 10+ and macOS 12+ · Need help? [email protected]