Loading...
AdminLTE Logo
  • PSRTI | Completed Research Studies
AdminLTE Logo PSRTI



Web Scraping for Price Statistics in the Philippines

Year: 2022
Author: Manuel Leonard F. Albis, Sabrina O. Romasoc, Bea Andrea C. Gavira, Shushimita G. Pelayo & Jazzen Paul J. Asombrado

Abstract:

Official price statistics in the Philippines are mainly sourced from the conduct of regular surveys and censuses which entail high costs. As businesses move into digital platforms, alternatives to these traditional data sources have become more available; one of which is web scraping. Web scraping is the process of collecting information from the web. As digital and online platforms become increasingly utilized for commerce, web scraping offers a way to increase the frequency of data collection while reducing its cost compared to price surveys. This paper aims to compute an online Consumer Price Index (CPI) of the National Capital Region (NCR), specifically for Divisions 1 and 2 of the Philippine Classification of Individual Consumption According to Purpose (PCOICOP), which will be compared to the official CPI of NCR calculated by the PSA. In addition to the official methodology of the CPI, a hybrid approach is introduced in this study for the computation of the online CPI. Finally, this paper presents the results of the official run of the developed web scraping programs and provides recommendations that will be useful for future web scraping projects in the Philippines.

Keywords:web scraping, online prices, CPI, PCOICOP, R, RSelenium, rvest

File request
First name
Middle name
Last name
Email
Agency
DATA PRIVACY CONSENT/AGREEMENT

The Privacy Notice of the Philippine Statistical Research and Training Institute:

Personal Information and Purpose:

We shall be getting the following personal information from you when you electronically submit to us your registration form, inquiries, complaints, suggestions and other requests:

In the conduct of our research and training activities, you will be asked for pertinent personal information such as your name, age, gender, contact information, and other details. The personal information we collect will be used solely for documentation and processing purposes within the PSRTI and will NOT be shared with any outside parties. The data will also be used to provide clients and stakeholders updates and advisories of our statistical researches and training-related activities.

Protection of Clients Data/Information:

We give assurance that all personal information will be stored, secured and kept in strictest confidence by PSRTI where only authorized personnel can have access to online personal information. Collected/gathered information will not be shared to any unauthorized PSRTI staff or those from outside parties.

You hereby grant permission to PSRTI to:
  • collect, store and manage the data collected from you.
  • randomly capture photo/s for the purpose of research/training documentation to be used in marketing collaterals, ads, website and other marketing mediums of PSRTI.
  • use your e-mail to receive e-flyers and other brochures for marketing purposes.
Version 1.0.0
Copyright © 2020 psrti.gov.ph. All rights reserved.