ercedes-owe/myDealz-Scraper
mydealz deals data extractor
myDealz Scraper
This scraper pulls detailed deal information from myDealz, Germany’s biggest community-driven deal platform. It lets you extract structured data from any deal category or filtered listing page, giving you clean insights into pricing trends, hot deals, and user-driven promotions. If you're building a comparison tool or monitoring competitive pricing, this scraper delivers reliable results quickly.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for myDealz Scraper you've just found your team — Let's Chat. 👆👆
Introduction
The myDealz Scraper collects deal-level information from pages you specify—whether it’s a category, a filtered results page, or a custom deal stream. It’s built for analysts, price-tracking tools, e-commerce teams, and hobby deal hunters who need structured, up-to-date deal data without manual browsing.
Why It’s Helpful
- Extracts complete deal metadata directly from myDealz listings.
- Works with any deal group or category page, including filtered URLs.
- Provides consistent, structured JSON ready for analysis or automation.
- Helps track price drops, community heat scores, merchants, and more.
- Fits perfectly into price comparison engines or trend dashboards.
Features
| Feature | Description |
|---|---|
| Category & Filter Support | Scrapes from any myDealz list page—including filtered or sorted views. |
| Rich Deal Extraction | Captures title, price, discount, heat, merchant, timestamps, and more. |
| Pagination Handling | Follows listing pages automatically for larger result sets. |
| Structured Output | Outputs consistent JSON objects suitable for automated workflows. |
| Flexible Input | Accepts any listing URL via startUrls. |
| Reliable Crawling | Uses fast HTML parsing for stable extraction. |
What Data This Scraper Extracts
| Field Name | Field Description |
|---|---|
| title | Deal headline. |
| url | Direct link to the deal page. |
| price | Listed deal price. |
| originalPrice | Original or crossed-out price if available. |
| discount | Calculated or displayed discount value. |
| heat | Community heat score. |
| merchant | Seller or store associated with the deal. |
| category | Deal category or group. |
| postedAt | Timestamp when the deal was posted. |
| author | Username of the posting member. |
| image | Thumbnail image URL. |
| description | Short deal description from the listing page. |
Example Output
[
{
"title": "Samsung 4K Smart TV 55 Zoll",
"url": "https://www.mydealz.de/deals/samsung-4k-55-123456",
"price": 399,
"originalPrice": 599,
"discount": "33%",
"heat": 842,
"merchant": "MediaMarkt",
"category": "Elektronik",
"postedAt": "2024-03-11T08:22:00Z",
"author": "DealHunterDE",
"image": "https://images.mydealz.de/deal_123456.jpg",
"description": "Guter Preis für den beliebten 55'' Samsung 4K Fernseher."
}
]
Directory Structure Tree
myDealz Scraper/
├── src/
│ ├── main.js
│ ├── scraper/
│ │ ├── list_parser.js
│ │ ├── pagination_handler.js
│ │ └── deal_normalizer.js
│ ├── utils/
│ │ ├── fetch.js
│ │ └── logger.js
│ └── config/
│ └── settings.example.json
├── data/
│ ├── sample_input.json
│ └── sample_output.json
├── package.json
└── README.md
Use Cases
- Price comparison tools enrich their databases with real deal pricing and discount trends.
- E-commerce analysts track competitor promotions and seasonal deal patterns.
- Deal-tracking apps fetch fresh myDealz entries for alerts and feeds.
- Market researchers analyze customer voting behavior and heat scores.
- Retail teams monitor marketplace sentiment around their brand and offers.
FAQs
Do I need a specific type of URL?
Any myDealz listing page works—categories, filtered lists, sorted pages, etc.
Does it scrape individual deal pages?
It extracts all fields available from listing pages; deep scraping can be added if needed.
Can it handle many pages?
Yes, it paginates through results automatically.
Is the output structured?
All results are returned as consistent JSON objects.
Performance Benchmarks and Results
Primary Metric:
Parses dozens of deals per second using lightweight HTML extraction.
Reliability Metric:
Maintains a stable success rate above 97% even on long paginated lists.
Efficiency Metric:
Optimized pagination limits redundant requests and accelerates throughput.
Quality Metric:
Produces clean, accurate deal records ready for price tracking or analytics.
