Activities per year
Abstract
Web scraping (or Web crawling), a technique for automated data extraction from websites, has emerged as a valuable tool for scientific research and data analysis. This paper presents a comprehensive exploration of Web scraping, its methodologies and challenges.The discussion revolves around a concrete application, namely the automatic extraction of data concerning the Belgian real estate market. We introduce a real-time Web scraper called \scrimmo~and tailored to collect data from websites containing real estate classified ads. The tool is developed in a continuous iterative process and based on an innovative cloud architecture. The paper also briefly addresses the ethical aspects of Web scraping. By integrating insights from previous research and ethical guidelines, this study provides researchers with a comprehensive understanding of Web scraping and its potential benefits, while promoting responsible and ethical practices in data collection and analysis.
Original language | English |
---|---|
Title of host publication | Proceedings - 2023 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2023 |
Pages | 335-338 |
Number of pages | 4 |
ISBN (Electronic) | 9798350309188 |
DOIs | |
Publication status | Published - Oct 2023 |
Event | The 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology - Venise, Italy Duration: 26 Oct 2023 → 29 Oct 2023 https://www.wi-iat.com/wi-iat2023/index.html |
Publication series
Name | Proceedings - 2023 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2023 |
---|
Conference
Conference | The 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology |
---|---|
Abbreviated title | WI-IAT 2023 |
Country/Territory | Italy |
City | Venise |
Period | 26/10/23 → 29/10/23 |
Internet address |
Keywords
- Data analysis
- Data extraction
- Data gathering
- Web crawling
- Web scraping
Fingerprint
Dive into the research topics of 'ScrImmo: A Real-time Web Scraper Monitoring the Belgian Real Estate Market'. Together they form a unique fingerprint.-
The 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology
Yernaux, G. (Participant)
26 Oct 2023 → 29 Oct 2023Activity: Participating in or organising an event types › Participation in conference
-
ScrImmo: A Real-time Web Scraper Monitoring the Belgian Real Estate Market
Yernaux, G. (Speaker)
2023Activity: Talk or presentation types › Oral presentation
Student theses
-
L’extraction de données pour refléter les activités du marché immobilier namurois en temps réel et l’intégrer dans un système de support à la décision pour aider les collectivités locales à prendre des décisions informées
BARZIN, F. (Author), Vanhoof, W. (Supervisor) & Yernaux, G. (Co-Supervisor), 20 Jun 2023Student thesis: Master types › Master in Computer science
File