DIKUL - logo
E-viri
Celotno besedilo
Recenzirano
  • Data Verification Methodolo...
    Kramer, Eric; Noh, Hyunsoo; Li, Xiao

    Transportation research record, 05/2022, Letnik: 2676, Številka: 5
    Journal Article

    Researchers on travel behavior and regional economic trends increasingly rely on multiple data sources to locate employers and site-specific employment. In a previous study, we proposed a method to assess and integrate multiple sources of employment data using three components: the Google Places application programming interface (API), a business existence verification model, and manual reviews of sampled data. This paper updates our previous methodology with a dual conditional classification of incoming and previously verified employment data made possible by checks using Google Places API and two rounds of string comparisons for both business names and establishment locations. The resulting match classes distinguish well-matched or confirmed business listings from those that require additional review to evaluate potential business closure or relocation. This screening process, augmented with fuzzy logic string matching techniques, reduces the effort needed to update employer information and assists with automated data standardization and deduplication, integrating incoming employment information with a database of verified employers.