more shazeline's comments

shazeline · on Jan 28, 2015

One common approach is to look for the elbow in the curve <metric> vs K (number of clusters). This is essentially finding the number of clusters after which the rate of information gained/variance explained/<metric> slows. I believe it's possible to binary search for this point if you can assume the curve is convex.

shazeline · on Jan 18, 2015

Current UCLA student. I just sent a request. Waiting to see what happens.

pikachu_is_cool · on Jan 19, 2015

Keep me updated please!

shazeline · on Aug 30, 2014

I think it is more accurate to say that data science isn't Kaggle. The process of taking a data set and fitting it to the most robust model is certainly machine learning.

shazeline · on May 26, 2014

I wrote a quick script to generate a list similar to (but not as deep or well-annotated as) Prof Palsberg's.

https://gist.github.com/shazeline/d9881d06be31a59a93d3

It's a BFS for Google Scholar seeded with Herbert Simon.

shazeline · on May 1, 2014

postmaster

shazeline · on March 18, 2014

Yeah, trg2 would need to put all the edge case rules towards the beginning. That, or just put the basic rules at the beginning and have separate conditionals at the end to handle the edge cases.

trg2 · on March 18, 2014

Good call - thanks for the CR! :)

shazeline · on Sept 19, 2013

reminds me of commit logs from last night

http://www.youtube.com/watch?v=V44kscaJe3M