Loading...
AdminLTE Logo
  • PSRTI | Completed Research Studies
AdminLTE Logo PSRTI



Web Scraping for Price Statistics in the Philippines

Year: 2020
Author: Manuel Leonard F. Albis, Maegan S. Saroca, Shushimita G. Pelayo & Jessa S. Lopez

Abstract:

Official price statistics in the Philippines are mostly sourced from regular surveys and censuses, the conduct of which entail high costs. Alternatives to these traditional data sources have presently become available as businesses move into digital platforms, one of which is web scraping. Web scraping is the process of collecting information from the web. As digital and online platforms become increasingly utilized for commerce, web scraping may help in reducing the frequency and cost brought about by price surveys due to time and resource constraints. This paper aims to determine the feasibility of web scraping in collecting price statistics for the calculation of the Consumer Price Index (CPI) in the Philippines. This study includes a pilot-run of a fourstage web scraping process conducted thrice a week during Mondays, Wednesdays and Fridays for one week through an automated platform developed using the R software and designed to scrape prices on a regular interval. This paper also presents a web scraping feasibility assessment of each major division of the Philippine Classification of Individual Consumption According to Purpose (PCOICOP). Ultimately, this paper provides recommendations for future web scraping projects in the Philippines.

Keywords:web scraping, online prices, CPI, PCOICOP, R

File request
First name
Middle name
Last name
Email
Agency
DATA PRIVACY CONSENT/AGREEMENT

The Privacy Notice of the Philippine Statistical Research and Training Institute:

Personal Information and Purpose:

We shall be getting the following personal information from you when you electronically submit to us your registration form, inquiries, complaints, suggestions and other requests:

In the conduct of our research and training activities, you will be asked for pertinent personal information such as your name, age, gender, contact information, and other details. The personal information we collect will be used solely for documentation and processing purposes within the PSRTI and will NOT be shared with any outside parties. The data will also be used to provide clients and stakeholders updates and advisories of our statistical researches and training-related activities.

Protection of Clients Data/Information:

We give assurance that all personal information will be stored, secured and kept in strictest confidence by PSRTI where only authorized personnel can have access to online personal information. Collected/gathered information will not be shared to any unauthorized PSRTI staff or those from outside parties.

You hereby grant permission to PSRTI to:
  • collect, store and manage the data collected from you.
  • randomly capture photo/s for the purpose of research/training documentation to be used in marketing collaterals, ads, website and other marketing mediums of PSRTI.
  • use your e-mail to receive e-flyers and other brochures for marketing purposes.
Version 1.0.0
Copyright © 2020 psrti.gov.ph. All rights reserved.