Please use this identifier to cite or link to this item:
https://rfos.fon.bg.ac.rs/handle/123456789/1944| Title: | Geometric Estimation of Specificity within Embedding Spaces | Authors: | Arabzadeh, Negar Zarrinkalam, Fattane Jovanović, Jelena Bagheri, Ebrahim |
Issue Date: | 2019 | Publisher: | Assoc Computing Machinery, New York | Abstract: | Specificity is the level of detail at which a given term is represented. Existing approaches to estimating term specificity are primarily dependent on corpus-level frequency statistics. In this work, we explore how neural embeddings can be used to define corpus-independent specificity metrics. Particularly, we propose to measure term specificity based on the distribution of terms in the neighborhood of the given term in the embedding space. The intuition is that a term that is surrounded by other terms in the embedding space is more likely to be specific while a term surrounded by less closely related terms is more likely to be generic. On this basis, we lever-age geometric properties between embedded terms to define three groups of metrics: (1) neighborhood-based, (2) graph-based and (3) cluster-based metrics. Moreover, we employ learning-to-rank techniques to estimate term specificity in a supervised approach by employing the three proposed groups of metrics. We curate and publicly share a test collection of term specificity measurements defined based on Wikipedia's category hierarchy. We report on our experiments through metric performance comparison, ablation study and comparison against the state-of-the-art baselines. | URI: | https://rfos.fon.bg.ac.rs/handle/123456789/1944 |
| Appears in Collections: | Radovi istraživača / Researchers’ publications |
Show full item record
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.