varinrdudas1eat/dubizzle-jobs-search-scraper
Dubizzle jobs listings extractor
Dubizzle Jobs Search Scraper
Dubizzle Jobs Search Scraper is a powerful tool for collecting structured job listings from Dubizzle across the UAE. It helps professionals turn scattered job posts into clean, analysis-ready data. Ideal for market intelligence, recruitment research, and employment trend analysis.
Created by Bitbash, built to showcase our approach to Scraping and Automation!
If you are looking for dubizzle-jobs-search-scraper you've just found your team — Let’s Chat. 👆👆
Introduction
This project extracts comprehensive job listing data from Dubizzle job search pages using URLs or advanced filters. It solves the problem of manual job market research by automating large-scale data collection. It is built for recruiters, analysts, researchers, and businesses focused on the UAE employment market.
UAE Job Market Intelligence at Scale
- Collects detailed job listings across all major UAE cities
- Supports both URL-based and filter-based job discovery
- Captures rich metadata for deep employment analysis
- Designed for high accuracy and repeatable research workflows
Features
| Feature | Description |
|---|---|
| Dual Scraping Modes | Scrape jobs using direct URLs or dynamic search filters. |
| Rich Job Metadata | Extracts over 40 structured fields per job listing. |
| Location Intelligence | Captures emirate, city, and neighborhood data. |
| Smart Filtering | Filter jobs by keyword, salary, experience, type, and remote work. |
| Scalable Collection | Handles multiple categories and large result sets efficiently. |
What Data This Scraper Extracts
| Field Name | Field Description |
|---|---|
| id | Unique internal job identifier. |
| uuid | Universal unique identifier for the job listing. |
| name | Job title in multiple languages. |
| added | Timestamp when the job was posted. |
| absolute_url | Direct link to the job listing. |
| category | Job category and classification details. |
| site | Emirate or regional site identifier. |
| location_list | Country, city, and neighborhood hierarchy. |
| employment_type | Full-time, part-time, contract, or temporary. |
| salary | Declared salary range when available. |
| experience | Required work experience level. |
| company_name | Employer name when disclosed. |
| benefits | Job benefits such as visa or accommodation. |
| description | Full job description and requirements. |
Example Output
[
{
"id": 77329,
"uuid": "aa547fa355cf46969fc84a471e4e9b52",
"name": {
"en": "Massage Therapists"
},
"employment_type": "Full Time",
"salary": "Less than 2,000",
"location": "Sharjah - Al Darari",
"company_name": "Confidential",
"absolute_url": "https://sharjah.dubizzle.com/jobs/beauty-salon/..."
}
]
Directory Structure Tree
Dubizzle Jobs Search Scraper/
├── src/
│ ├── main.py
│ ├── scraper/
│ │ ├── jobs_parser.py
│ │ ├── filters.py
│ │ └── pagination.py
│ ├── utils/
│ │ ├── http_client.py
│ │ └── normalizer.py
│ └── config/
│ └── settings.example.json
├── data/
│ ├── sample_input.json
│ └── sample_output.json
├── requirements.txt
└── README.md
Use Cases
- Recruitment agencies use it to track hiring demand and salary trends, enabling better candidate placement.
- Market researchers analyze employment patterns across UAE cities to identify growth sectors.
- Business analysts monitor competitor hiring activity to inform strategic planning.
- Job aggregators expand their listings database with structured, up-to-date opportunities.
- Academic researchers study workforce distribution and labor market dynamics.
FAQs
Q: Can I scrape multiple job categories at once?
Yes, you can provide multiple category or search URLs, or broaden filters to cover several job types in a single run.
Q: Does it support all UAE cities?
Yes, it supports nationwide searches as well as specific emirates such as Dubai, Abu Dhabi, Sharjah, and others.
Q: How do I avoid duplicate job listings?
Each job includes a unique ID and UUID, which can be used for reliable deduplication during post-processing.
Q: Is the output suitable for analytics tools?
Absolutely. The structured JSON format is designed for direct use in dashboards, spreadsheets, and BI pipelines.
Performance Benchmarks and Results
Primary Metric: Processes up to 20 job listings per page with consistent extraction accuracy.
Reliability Metric: Maintains a high success rate through retry logic and resilient request handling.
Efficiency Metric: Optimized pagination and filtering reduce unnecessary requests and data overhead.
Quality Metric: Delivers high data completeness with detailed fields across job, company, and location attributes.
