To Have a Tiger by the Tail: Improving Music Recommendation for International Users Philippe Hamel Google, Mountain View, CA, USA

1. Introduction At the heart of any music recommendation system is the notion of music similarity. In order to produce relevant recommendations, a system must have some internal representation of how musical entities are related together. Such systems usually learn this representation through different types of data: audio features, user listening data, user ratings, metadata, expert annotations, etc. One of the most useful signals for building a music similarity representation for recommendation comes from learning a collaborative filtering (CF) space from user data such as user listening behavior and user ratings. However, the distribution of this data is heavily biased towards the most popular music, what we call the head of the distribution. We have much less data on less popular artists from the tail of the distribution. In consequence, the CF space represents relatively accurately head artists, and becomes noisy for tail artists. This problem becomes even more important when we consider an international user base. Since online services are not uniformly popular or available across the world, the user data distribution is heavier in some countries. Thus, the geopolitical distribution of the users will have a great effect on the CF representation. If this fact is not taken into account, CF models trained over this data will tend to spend most of their capacity modeling popular music from a few countries, while regional or culturally specific music will be somewhat ignored by the model. In practice, when using a naive CF algorithm, locally popular artist representations will tend to cluster tightly in culture-specific clusters. Recommendation models based on these representations will tend to see all artists from small cultural groups as similar, while listeners from this group will consider them as dissimilar.

c Philippe Hamel, Licensed under a Creative Commons Attribu tion 4.0 International License (CC BY 4.0). Attribution: Philippe Hamel, “To Have a Tiger by the Tail: Improving Music Recommendation for International Users” Machine Learning for Music Discovery Workshop at the 32nd International Conference on Machine Learning, Lille, France, 2015.

HAMELPHI @ GOOGLE . COM

In this work, we focus on trying to improve the representation of music similarity for tail music content, in particular for content that is specific to a culture or a geographical region, while preserving the quality of the representation for head content. We propose to improve a basic CF model by mixing in external signals and combining local CF models.

2. Baseline 2.1. Music affinity score In this work, we focus on the task of finding similar items (track, artist, album, etc.), given a query item in the context of recommendation. The concept of music similarity is ill-defined and depends on what task we are trying to solve. Here, we define similarity in the following way. An item B is similar to item A if B is a good recommendation for users who like item A. What constitutes a good recommendation is also vague and context dependent, but can be empirically measured with offline ground truths or online user feedback. We denote this similarity relationship as an affinity score aff (A → B). A higher affinity score, means that B is a better recommendation for A. Note that this relationship is not necessarily symmetric, i.e. aff (A → B) 6= aff (B → A) in general. To obtain a list of recommendations for an item, we compute the affinity score between the query and all items, and take the top results from the ranked list. 2.2. Base model Given user listening data (play counts, ratings, etc.) it is possible to build a sparse user-item relationship matrix, where items could be tracks, artists or other musically related entities. From this matrix, we can learn a lower dimensional representation of user and items using an SVDlike algorithm. This is what we refer to as the CF space. From the learned representation of the items, we can define a metric to compute an affinity score between two items. For instance, one could define affinity as cosine similarity in the item space. The details of how we obtain this CF affinity is beyond the scope of this work. However, for the sake of argument, let’s assume that we can compute a CFbased affinity affCF (A → B) learned from our user data. This is the base model we aim to improve on.

Improving Music Recommendation for International Users

3. Proposed improvements Starting with the base CF model, how can we improve recommendations for tail content? We first consider mixing in external signals to complement the CF representation. Then, we try to improve the CF representation by combining several local models. 3.1. Mixing affinity signals One obvious way to improve on the CF based affinity is to mix in signals that do not exhibit the same cultural or popularity bias as the user data with the base CF signal. We can obtain a global affinity score by computing a weighted sum of the signal affinities:X aff (A → B) = Wi affi (A → B) i∈signals

where signals include the base CF signal as well as other signals discussed below, and the weights Wi are optimized through empirical evaluation. The weights can also be a function of the seed query. This could allow, for instance, to put more weight on non-CF signals for queries for which the CF signal is noisy. 3.1.1. C ONTENT- BASED FEATURES One of the most obvious signals to include in a music recommender system is the actual audio signal. Audio features are blind to music popularity and social relationships. They can give a good representation of genre, mood, instrumentation, tempo, etc. This makes them a good complement to the CF signal. One challenge with audio features is to obtain a smooth feature space from which to obtain a relevant affinity score between items. One way to solve this problem is to train a supervised learning model on top of the audio features. In this case, the targets used to train the model represent proxies for audio similarity. Genres, tags or even the CF representation itself can be good targets for training the audio model.

3.1.3. W EB - CRAWLING It is possible to obtain artist relationship data from other sources on the web. For instance, co-occurence analysis on Wikipedia or music blogs can provide a good complementary signal for artists for which listening data is sparse or unavailable. Web search data can also be a very useful signal, although it might not be publicly available. 3.2. Training local models The CF representation can be improved by making a better use of the sparse tail data. One way to do this is to explicitly divide the users into several sub-groups and train CF models on these groups. By dividing the data into smaller targeted datasets, it is possible to better understand the relationship between items within these groups. For instance, to account for the non-uniform geographic distribution, we can train a model for each country. Or, to account for cultural groups within countries, we can divide the data by country and language. To obtain a global CF affinity function, we can combine the affinity from all the local models: P Wi (A, B)affi (A → B) i∈datasets P affglobalCF (A → B) = Wi (A, B) i∈datasets

where Wi (A, B) are weights that grow with the popularity of items A and B within the dataset i. This has the effect that groups that know more about items A and B will influence more the global affinity for these items. Dividing the data into explicit user groups requires user demographic information. However, it is still possible to use this technique even without explicit demographic information. If we assume that some user characteristics influence the listening behavior, it should be possible to find these groups from the data. One way to infer user groups is to use an unsupervised clustering algorithm on the user data.

3.1.2. H UMAN - CURATED INFORMATION One other way to improve recommendations is to use expert annotated data. Annotated data can include information such as genre, instrumentation, mood, melodic content description, etc. Curated data has the advantage of being very reliable. A simple affinity function based on curated data could for instance give a high score when two items exhibit similar relevant characteristics.

4. Conclusion

The main problem with expert annotated data is that is costly to obtain and hard to scale internationally. It is virtually impossible to have experts cover all of the tail content. Crowdsourcing is a good alternative to expert annotation. It is relatively cheap and easily scalable, at the cost of added noise.

Acknowledgments

The task of finding relevant recommendations for tail content is challenging. In this work, we discuss why this is hard, and propose improvements on a base CF model. The solutions proposed include using content-based models, leveraging human-curated data, mixing in external sources, training local models and modeling user characteristics.

This work was done in collaboration with several researchers and engineers. I would like to thank Li Zhang, Rasmus Larsen, Sally Goldamn, Nicolas BoulangerLewandowski, Adam Roberts, Pierre Grinspan, Douglas Eck, and Daniel Steinberg for their contributions.

To Have a Tiger by the Tail: Improving Music ... - Semantic Scholar

distribution of this data is heavily biased towards the most popular music ... Since online services are ... tion of music similarity for tail music content, in particular.

119KB Sizes 0 Downloads 202 Views

Recommend Documents

To Have a Tiger by the Tail: Improving Music ... - Semantic Scholar
on Wikipedia or music blogs can provide a good comple- mentary signal for artists for which listening data is sparse or unavailable. Web search data can also be ...

A Pandemonium Can Have Goals - Semantic Scholar
the code for matching a part (e.g. the subject, the sender and the address of an email). Differently ... A Band is the resultant of auto-organization in a bottom-up.

A Pandemonium Can Have Goals - Semantic Scholar
For each agent, more energy means more resources (e.g. more computational .... between multiple alternative goals to pursue, and to decide to adopt goals from ...

Transfer Understanding from Head Queries to Tail ... - Semantic Scholar
Nov 7, 2014 - Microsoft [email protected]. Shusen Wang. Zhejiang University [email protected]. ABSTRACT. One of the biggest challenges of commercial search ... data (query logs). Tail queries have little historical data to rely on, which makes them d

A Machine Learning Approach to Automatic Music ... - Semantic Scholar
by an analogous-to-digital converter into a sequence of numeric values in a ...... Proceedings of the 18th. Brazilian Symposium on Artificial Intelligence,.

Improving Health: Changing Behaviour. NHS ... - Semantic Scholar
Difficult situations and if-then plans (client worksheet). 53-54. Dealing ... inequalities - accredited Health Trainers. Health Trainers ... science can help people in changing habits and behaviour. ... 1 Carver, C.S. & Scheier, M.F. (1998). On the .

Improving Embeddings by Flexible Exploitation of ... - Semantic Scholar
Many machine learning approaches use distances between data points as a ..... mapping it into new spaces and then applying standard tech- niques, often ...

Improving English Pronunciation: An Automated ... - Semantic Scholar
have set up call centers in India, where telephone operators take orders for American goods from. American customers, who are unaware that the conversation ...

Improving Health: Changing Behaviour. NHS ... - Semantic Scholar
Identify facilitators (including social support) and encourage the client to make ..... computer) are kept securely and are accessible only to you, your client and ...... chat room (these are message boards on the internet) where you could go for.

Improving Access to Web Content at Google - Semantic Scholar
Mar 12, 2008 - reading application, this site is completely accessible for people with visual loss. It can be a terrific commuting and trip planning tool”. - Rob ...

Land race as a source for improving ... - Semantic Scholar
KM-1 x Goa local and C-152 x Goa local F1 hybrids yielded better than the best parent, a land race itself. This improved ..... V.P., 2000, Genotypic difference in.

Land race as a source for improving ... - Semantic Scholar
KM-1 x Goa local and C-152 x Goa local F1 hybrids yielded better than the best parent, a land race itself. This improved ..... V.P., 2000, Genotypic difference in.

The challenges of six sigma in improving service ... - Semantic Scholar
The current issue and full text archive of this journal is available at ... Recently, a number of articles has focused on the importance of six sigma for ...... Rylander, D.H. and Provost, T. (2006), “Improving the odds: combining six sigma and onl

A Appendix - Semantic Scholar
buyer during the learning and exploit phase of the LEAP algorithm, respectively. We have. S2. T. X t=T↵+1 γt1 = γT↵. T T↵. 1. X t=0 γt = γT↵. 1 γ. (1. γT T↵ ) . (7). Indeed, this an upper bound on the total surplus any buyer can hope

Semantic Queries by Example - Semantic Scholar
Mar 18, 2013 - a novel method to support semantic queries in relational databases with ease. Instead of casting ontology into rela- tional form and creating new language constructs to express ...... uni-karlsruhe.de/index_ob.html. [19] OTK ...

A Appendix - Semantic Scholar
The kernelized LEAP algorithm is given below. Algorithm 2 Kernelized LEAP algorithm. • Let K(·, ·) be a PDS function s.t. 8x : |K(x, x)| 1, 0 ↵ 1, T↵ = d↵Te,.

Semantic Queries by Example - Semantic Scholar
Mar 18, 2013 - Finally, we apply the query semantics on the data to ..... mantic queries involving the ontology data are usually hard ...... file from and to disk.

Iterative mesh partitioning strategy for improving ... - Semantic Scholar
Accordingly, the overall efficiency of parallel analysis is not optimized. A better ... This work embeds, in MPE++ a graph partitioning package, called. JOSTLE ...

When one Sample is not Enough: Improving Text ... - Semantic Scholar
Computer Science Department. Columbia .... contains articles about Computer Science. .... ment classification [22], each category level might be assigned.

Iterative mesh partitioning strategy for improving ... - Semantic Scholar
Computer-Aided Engineering, Department of Civil Engineering, National Taiwan ... substructure interior degrees-of-freedom are performed independently and .... elements to achieve a good balance of workloads among substructures.

Improving DNN speaker independence with $i - Semantic Scholar
in recent years, surpassing the performance of the previous domi- nant paradigm ... part-model, for each speaker which, in a cloud-based speech recog- nizer adds ... previous layer, with the first layer computing a weighted sum of ex-.

Steps for Improving Data Comprehension for ... - Semantic Scholar
data is not collected internally, limiting detection ability of insider threats. Alternatively ... tempts to classify current data without the use of training data. Anomalous ..... as proposed in [15] are insufficient; such Internet scale tech- nique

When one Sample is not Enough: Improving Text ... - Semantic Scholar
For illustration purposes, Table 2 reports the computed ...... Zipf, power-laws, and Pareto – A ranking tutorial. Available at http://ginger.hpl.hp.com/shl/-.