SAFETYLIT WEEKLY UPDATE

We compile citations and summaries of about 400 new articles every week.
RSS Feed

HELP: Tutorials | FAQ
CONTACT US: Contact info

Search Results

Journal Article

Citation

Dietz LW, Sertkan M, Myftija S, Thimbiri Palage S, Neidhardt J, Wörndl W. Front. Big Data 2022; 5: e829939.

Copyright

(Copyright © 2022, Frontiers Media)

DOI

10.3389/fdata.2022.829939

PMID

35464121

PMCID

PMC9022027

Abstract

Characterizing items for content-based recommender systems is a challenging task in complex domains such as travel and tourism. In the case of destination recommendation, no feature set can be readily used as a similarity ground truth, which makes it hard to evaluate the quality of destination characterization approaches. Furthermore, the process should scale well for many items, be cost-efficient, and most importantly correct. To evaluate which data sources are most suitable, we investigate 18 characterization methods that fall into three categories: venue data, textual data, and factual data. We make these data models comparable using rank agreement metrics and reveal which data sources capture similar underlying concepts. To support choosing more suitable data models, we capture a desired concept using an expert survey and evaluate our characterization methods toward it. We find that the textual models to characterize cities perform best overall, with data models based on factual and venue data being less competitive. However, we show that data models with explicit features can be optimized by learning weights for their features.


Language: en

Keywords

data mining; content-based filtering; destination characterization; expert evaluation; rank agreement metrics; recommender systems

NEW SEARCH


All SafetyLit records are available for automatic download to Zotero & Mendeley
Print