ScrImmo: A Real-time Web Scraper Monitoring the Belgian Real Estate Market

Research output: Contribution in Book/Catalog/Report/Conference proceedingConference contribution

53 Downloads (Pure)

Abstract

Web scraping (or Web crawling), a technique for automated data extraction from websites, has emerged as a valuable tool for scientific research and data analysis. This paper presents a comprehensive exploration of Web scraping, its methodologies and challenges.The discussion revolves around a concrete application, namely the automatic extraction of data concerning the Belgian real estate market. We introduce a real-time Web scraper called \scrimmo~and tailored to collect data from websites containing real estate classified ads. The tool is developed in a continuous iterative process and based on an innovative cloud architecture. The paper also briefly addresses the ethical aspects of Web scraping. By integrating insights from previous research and ethical guidelines, this study provides researchers with a comprehensive understanding of Web scraping and its potential benefits, while promoting responsible and ethical practices in data collection and analysis.
Original languageEnglish
Title of host publicationProceedings - 2023 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2023
Pages335-338
Number of pages4
ISBN (Electronic)9798350309188
DOIs
Publication statusPublished - Oct 2023
EventThe 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology - Venise, Italy
Duration: 26 Oct 202329 Oct 2023
https://www.wi-iat.com/wi-iat2023/index.html

Publication series

NameProceedings - 2023 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2023

Conference

ConferenceThe 22nd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology
Abbreviated titleWI-IAT 2023
Country/TerritoryItaly
CityVenise
Period26/10/2329/10/23
Internet address

Keywords

  • Data analysis
  • Data extraction
  • Data gathering
  • Web crawling
  • Web scraping

Fingerprint

Dive into the research topics of 'ScrImmo: A Real-time Web Scraper Monitoring the Belgian Real Estate Market'. Together they form a unique fingerprint.

Cite this