rather than downvoting i'm actually curious why you think unsupervised learning ...

yummyfajitas · on Dec 4, 2016

To me at least, the major distinction between "classical statistics" and "machine learning" is that machine learning" strives to work independently of the underlying distribution while classical statistics tries to model it.

I.e., a statistician doing linear regression assumes that reality is linear (or at least differentiable) in the region of interest. A convergence proof of linear regression will use this assumption.

A machine learning practitioner does NOT assume reality actually has a random forest out there in the world somewhere, and as a result needs to prove far more general (and less accurate) convergence results for the random forest.

From what I can tell, this book falls into the former category.

xapata · on Dec 4, 2016

> assumes that reality is linear

The assumption is that a particular relationship is reasonable to model as if it were linear. No one believes reality is strictly linear.

I've read your posts enough to believe you know how linear regression works. I'm criticizing your comment because it encourages a misunderstanding of traditional statistics as having nonsensical assumptions.

yummyfajitas · on Dec 4, 2016

Out of curiosity, was my caveat "(or at least differentiable) in the region of interest" insufficient for that purpose?

I certainly didn't mean to imply that statistics has unreasonable assumptions. Merely that it tends to have stronger assumptions - and more accurate results - than machine learning. Personally I'm a huge fan of classical statistics and think it's currently underappreciated.

xapata · on Dec 4, 2016

The caveat doesn't work for a technical reason and a more important practical reason. Most relationships, even ones that aren't proper functions, can be transformed into a linear model. An absolute value function is non-differentiable for one value of the input, but it'd be perfectly fine to model with linear regression. More importantly, the audience I worry about isn't the type to pay attention to parenthetical notes using jargon. Linear is somewhat accessible jargon, but differentiable is less so. I'm not claiming that I write clearly, but I aim to write such that I don't need caveats.

jefft255 · on Dec 4, 2016

Yes it really is underappreciated. As quoted by other comments, "Most businesses think they need advanced ML and really what they need is linear regression and cleaned up data". A significant portion of businesses currently investing millions in ML should basically hire a couple of statisticians and get over it.

xapata · on Dec 4, 2016

To be fair, the fully loaded cost of a couple statisticians (ones who can code, or combined with an engineer assistant) might be half a million or more annually.

eli_gottlieb · on Dec 5, 2016

>A machine learning practitioner does NOT assume reality actually has a random forest out there in the world somewhere, and as a result needs to prove far more general (and less accurate) convergence results for the random forest.

Of course, most of the time nowadays, "throw a neural network or an SVM at it" doesn't really require strong convergence results... even though there are some nice analytical results for support-vector machines.

huac · on Dec 4, 2016

I think yummyfajita's point is that a traditional statistics approach begins with some understanding of the system being modeled, and that you create a model using that understanding. There is usually a high focus on parsimoniousness and explainability, while in ML/AI, you don't really care what the underlying model is or how the model comes to a particular conclusion. The focus is on accuracy at the expense of explainability.

povik · on Dec 4, 2016

I am afraid you misunderstood. OP's point isn't that it's unsupervised learning, hence not ML. His point is that it's no learning at all. IMHO.