Media monitoring teams
Daily clips
Collect article URLs, sources, dates, and body text into one CSV before tagging coverage by topic, region, company, or executive mention.
Limited Time — Lifetime Access for just $99. Lock in before prices rise.
A Yahoo News scraper for turning live article pages into a clean CSV. The template starts at Yahoo News, discovers current article links from listing, ranking, topic, and flash pages, then exports page address, title, date, news source, image URL, and article body text into yahoo-news-scraper.csv from the local desktop app.
CSV file
6
6 blocks
Live pages
Free template
At a glance
Article pages become spreadsheet rows
The workflow discovers live news.yahoo.co.jp/articles/ URLs, opens the article HTML, and writes a structured row for each usable story. That makes Yahoo News to CSV practical for analysts who need a repeatable file instead of copied browser text.
Body text is included when available
Many news scrapers stop at the headline. This template looks for structured article data first, then falls back to readable paragraph text so the CSV can support keyword review, summaries, or downstream classification.
Runs under your control
The browser session and export file live on your machine. That is useful for teams that cannot approve hosted news actors, proxy dashboards, or vendor-side retention of sensitive monitoring topics.
Built around live discovery
The template avoids stale sample URLs by discovering current links from Yahoo News entry points at run time. Deleted pages are skipped where possible, and empty runs surface a plain diagnostic row.
Who this is for
Media monitoring teams
Daily clips
Collect article URLs, sources, dates, and body text into one CSV before tagging coverage by topic, region, company, or executive mention.
Researchers and analysts
Text datasets
Build a small, traceable corpus for summaries, language review, source comparison, or classification experiments while preserving the original page address for audit.
Newsroom operations
Source checks
Export Yahoo News articles for internal review, then pair the row set with the Google News Keyword Scraper or Yahoo Search Results Scraper when a story needs broader search context.
How to use
Download and import
Use the page CTA to download the hosted JSON template, then import it into UScraper.
Open the live source
The Navigate block starts at https://news.yahoo.co.jp/, waits for the page to load, and confirms the document body is available.
Discover article URLs
The injected JavaScript scans the current page and a few Yahoo News listing surfaces for live article links, then limits the first batch so you can validate output before increasing scale.
Fetch and normalize fields
For each article URL, the workflow reads structured data, meta tags, headings, time elements, publisher fields, images, and article paragraphs in that order.
Export the CSV
Structured Export writes yahoo-news-scraper.csv with headers into the configured save folder.
Output preview
yahoo-news-scraper.csvColumn
page_address
Canonical article URL collected from live Yahoo News links.
Column
title
Cleaned article headline without the Yahoo News suffix.
Column
date
Published or modified date when Yahoo exposes it.
Column
news_source
Publisher or author/source name from structured data or metadata.
Column
image_url
Primary article image URL when available and not a default placeholder.
Column
text
Article body text from JSON-LD or readable paragraphs.
Sample rows
2 of many
| page_address | title | date | news_source | image_url | text |
|---|---|---|---|---|---|
| Cabinet approves new relief package after storm damage | 2026-06-10T09:15:00+09:00 | Daily News Wire | Officials said the package will prioritize housing repairs and regional infrastructure... | ||
| Markets watch export data as regional demand shifts | 2026-06-10 | Business Desk | Analysts pointed to currency pressure and inventory adjustments across manufacturers... |
For adjacent workflows, use the Google News Keyword Scraper for keyword feeds, the Yahoo Search Results Scraper for broader SERP coverage, and the DuckDuckGo Search Results Scraper when you need a second search surface. Browse the full UScraper template library or install the app from the download page.
Automating Yahoo News can conflict with Yahoo terms, robots directives, publisher rights, copyright, privacy rules, or local scraping law even when article pages are public. Keep volume modest, avoid paywalls or login walls, do not republish article bodies without permission, and consult counsel before commercial datasets.
Limits
Check these before production runs
Yahoo can redesign listing and article pages
The template prefers structured data and metadata before falling back to paragraphs, but major page changes can still reduce row quality. Validate output after each noticeable Yahoo News redesign.
Some discovered URLs may disappear
News pages expire, redirect, or return deleted-page messages. The workflow skips obvious missing pages and emits a diagnostic row if no live articles are found.
Article text is copyrighted content
Use exports for permitted internal review, research, or analysis. Do not republish, resell, or train commercial systems on publisher text unless your legal basis and platform permissions are clear.
Download and use this template instantly
UScraper templates are open source. Improve this workflow or contribute a new one to help the community grow.
Contribute on GitHubBrowse more templates in the library
All TemplatesHere are some of our most common questions. Can't find what you're looking for?
View All FAQsDownload UScraper and build your first web scraper in under 10 minutes. No subscriptions, no code, no limits.
Available on Windows 10+ and macOS 12+ · Need help? [email protected]