Hacker News new | past | comments | ask | show | jobs | submit login
Examples and best practices for building recommendation systems (github.com/microsoft)
122 points by navigaid on Jan 18, 2019 | hide | past | favorite | 8 comments



this really should've been named "recommendations for building recommendation systems"


hey one of the authors here. We are planning to create some notebooks on recommendation about recommendation algos :-)


LOL brilliant idea! just don’t hesitate to create an issue and follow that with a PR!


If you're the author, I'm not sure that the definition of recall@k is correct.

>Recall@k is a metric evaluate how many items, in the recommendation list, are relevant (hit) in the ground-truth data.


thanks for the comment, would this definition be clearer: "Recall at k is the proportion of relevant items found in the top-k recommendations"?


My collection of usefull links about recommender systems (dirty) https://gist.github.com/tandav/a2f87e91cb5c441c57657cceb788c...


Sadly but evaluation metrics are only implemented in pyspark.mllib (RDD API) but not in pyspark.ml (Dataframe API)

Also worth mention about implicit feedback


It is also my sadness - AFAIK pyspark.ml does not support ranking metrics (does it?). So we wrapped the RDD and expose data frame as interfaces in our own implementations (the `reco_utils`). Implicit feedback is mentioned in a notebook under `staging` branch (https://github.com/Microsoft/Recommenders/blob/staging/noteb...). Will be there soon in the next release (1st of Feb)!




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: