Limited Time — Lifetime Access for just $99. Lock in before prices rise.

UScraper
Tutorials

How to Scrape Paginas Amarillas Business Details to CSV

Scrape Paginas Amarillas detail pages to CSV. Extract names, phones, addresses, hours, categories and websites using UScraper local desktop workflow.

UScraper
July 1, 2026
8 min read
#how to scrape paginas amarillas#paginas amarillas scraper tutorial#scrape paginas amarillas business details#paginas amarillas scraper#paginas amarillas business details#best paginas amarillas scraper#octoparse paginas amarillas alternative#paginas amarillas scraper vs apify
How to Scrape Paginas Amarillas Business Details to CSV

This tutorial shows how to scrape Paginas Amarillas business details from approved detail-page URLs into CSV with the Paginas Amarillas Spain product-details template for UScraper. You will import the workflow, add source URLs, set the export path, run a small validation batch, and fix the common issues that show up in directory scraping.

Before you start

Prerequisites and scope

You need UScraper installed, the Paginas Amarillas scraper template imported, a folder for CSV exports, and a short list of business detail URLs you are allowed to process. Start with three to five URLs. A small first run makes it obvious whether names, phones, addresses, hours, and websites are rendering correctly before you add more pages.

This article covers product-detail style pages, not a broad site crawler. The bundled JSON uses a known URL list in the Navigate block, then loops through each page. If you need source URLs first, build them from a manual shortlist, an approved internal list, the official Paginas Amarillas home page, or directory entry points such as the activity index.

The right first run is not "all of Spain". It is five approved URLs, one CSV, and a field-by-field check against the browser.

Review the source site's current terms, robots.txt, privacy duties, and outreach rules before collecting or reusing contact data. Public visibility does not automatically mean unrestricted automated use.


Workflow shape

How the Paginas Amarillas scraper works

The template is intentionally compact: Navigate -> Wait for Page Load -> Wait for Element -> Structured Export -> Loop Continue. Navigate stores the detail URL list. The wait blocks give the page time to render and confirm that an h1 business title exists. Structured Export reads the page and appends the row. Loop Continue moves to the next URL.

The JSON export in the template is the authoritative workflow definition. In practical terms, it says:

{
  "input": "known paginasamarillas.es/f/... detail URLs",
  "waitFor": ["page load", "h1"],
  "outputFile": "paginas-amarillas-detalles-scraper.csv",
  "mode": "append",
  "columns": [
    "nombre",
    "pagina_url",
    "categoria",
    "telefono",
    "ubicacion",
    "horario",
    "informacion_y_servicio",
    "actividades",
    "descripcion",
    "sitio_web",
    "hora_actual"
  ]
}

That shape matters because it avoids guessing through search pagination during the detail export. You decide which business profiles belong in the run, and the scraper focuses on turning those selected pages into consistent CSV rows.

paginas-amarillas-detalles-scraper.csv
CSV - append

Column

nombre

Business name from the page title.

Column

pagina_url

Source detail URL without a hash fragment.

Column

categoria

Visible category or breadcrumb-derived category text.

Column

telefono

Phone number from visible tel links when present.

Column

ubicacion

Address or nearby location block from the profile.

Column

horario

Opening-hours text when the weekly schedule is visible.

Column

sitio_web

External website link exposed by the profile.

Headers included - one row per configured Paginas Amarillas detail URL

Runbook

How to scrape Paginas Amarillas business details to CSV

1

Import the template

Open the Paginas Amarillas Spain product-details template, download the JSON, and import it into UScraper.

2

Replace the sample URLs

In Navigate, replace the two sample Madrid detail pages with approved paginasamarillas.es/f/... URLs. Keep one city or category per first batch so validation stays simple.

3

Confirm the export path

In Structured Export, check the save folder, paginas-amarillas-detalles-scraper.csv, headers, and append mode. Use a new folder for each client, campaign, or test batch.

4

Run one URL first

Disable the rest of the list or keep only one URL, run the workflow, then compare every exported field against the browser page.

5

Scale only after QA

Restore the remaining URLs, run the loop, and inspect rows from the beginning, middle, and end of the CSV before importing the file elsewhere.


Validation

Validate names, phones, addresses, and websites

Do not treat the first CSV as final. Directory pages can omit fields, change labels, delay sections, or show different content by location and consent state. The fastest QA method is to keep the source page open beside the spreadsheet and check one row at a time.

SymptomLikely causeFix
Empty nombreThe page did not reach a visible h1 stateExtend the wait, handle prompts, and rerun one URL.
Blank telefonoNo visible phone link, hidden contact UI, or layout driftConfirm the phone is visible in the browser and update the selector if needed.
Weak categoriaCategory text moved or breadcrumb markup changedCompare several pages in the same category and adjust the extraction rule.
Missing sitio_webThe profile does not expose an external website linkKeep the blank cell and use pagina_url as the traceable source.
Duplicate rowsThe same URL was supplied twice or a batch was rerunDeduplicate by pagina_url before CRM import.

Alternatives

UScraper vs Octoparse, Thunderbit, and Apify

If you are comparing the best Paginas Amarillas scraper for a one-off business-detail export, decide on custody and workflow control first. UScraper is strongest when you want a supervised local desktop workflow, a visible browser session, and a CSV you can validate before it leaves your machine.

Octoparse and Thunderbit are useful when your team already works in their no-code scraping environments. Apify actors are useful when you want cloud execution, APIs, scheduling, or a hosted dataset pipeline. Those trade-offs can be worth it for production automation, but they also change pricing, account setup, and where the data is processed.

OptionBetter fitTrade-off
UScraper templateLocal CSV, visible QA, detail URLs you already selectedYou manage the desktop run and selector maintenance.
Octoparse-style templateTeams already using a hosted no-code scraperProject setup and vendor workflow rules vary.
Apify actorAPI-driven cloud jobs and scheduled datasetsData processing and billing follow the actor platform.
Custom Python scraperEngineers who need full controlYou own parsing, retries, compliance review, and maintenance.

FAQ

Paginas Amarillas scraper tutorial FAQ

Public directory pages can still be governed by site terms, robots directives, anti-abuse systems, privacy law, copyright, and outreach rules. Review the current source policies, use modest pacing, avoid bypassing access controls, and collect only fields you have a lawful reason to process.


Next step

Download the Paginas Amarillas product-details scraper

Use the Paginas Amarillas Spain product-details template as the download path and keep this tutorial open for setup and QA. For related workflows, browse the UScraper template library or read more CSV export guides on the UScraper blog.

FAQ

Frequently asked questions

Here are some of our most common questions. Can't find what you're looking for?

View All FAQs

Stop writing scripts. Start scraping visually.

Download UScraper and build your first web scraper in under 10 minutes. No subscriptions, no code, no limits.

Available on Windows 10+ and macOS 12+ · Need help? [email protected]