Hacker News new | past | comments | ask | show | jobs | submit | anton_tarasenko's comments login

Maker here. I've put together several custom search engines that helped me fight poor Google search performance on certain queries. The search engines use Google Custom Search Engine configured for selected websites and return succinct results on general queries, especially the queries attracting SEO spammers.

The engines are listed on https://searchcommons.org/engines.html and here are some of them:

— The web minus Alexa top 1000 most visited websites https://searchcommons.org/?e=most-visited-websites-excluded

— Official docs for Python and its libraries https://searchcommons.org/?e=python-docs (feel free to request an engine for your programming language)

— Universities with Nobel Prize, Fields Medal, and Turing Award laureates https://searchcommons.org/?e=universities-with-laureates

The lists of selected websites are public and open to contributions: https://github.com/antontarasenko/searchcommons


I have a similar private thing. The thing is ended up with duck duck duck go.


The AIP also maintain free access to their oral histories: https://www.aip.org/history-programs/niels-bohr-library/oral...

They interviewed many famous physicists. Mostly autobiographical, but unlike in books, the interviewees don't have time to edit the answer, so the reading experience is generally better.


Subreddit /r/askscience does a good job at explaining science in plain words. I usually google "site:reddit.com/r/askscience/ __QUESTION__".

The StackExchange sites have less coverage and answers tend to be more technical.

University websites return reliable answers, but often neither short nor accessible.


Thank you for going after primary sources.

The sample letter refers to the press. Executives tend to be general with the press. Investors make them talk specifics. That happens during quarterly earning calls. Which would be a good source. Seeking Alpha has the transcripts.

The SEC filings are another liable source. Though not personal quotes, they include perspectives on tech trends.


Yep - 2 things about this

1) I am using the press as a kind of filter of what's interesting (hypothesis: they pick the most interesting thing executives are saying on earning calls)

2) I am working on extracting other sources like earning transcripts and twitter, but this will be tougher for me to go through due to the sheer amount of things that are said in those sources

SEC is a good one too, the management analysis section of filings is always interesting so I could extract that.

Thanks for the feedback!


The Wikipedia page on Moore's law includes similar observations.[1] For example, the DNA sequencing cost.[2]

And another list that features laws, some of which describe exponential processes.[3]

[1] https://en.wikipedia.org/wiki/Moore%27s_law#Other_formulatio...

[2] https://en.wikipedia.org/wiki/Carlson_curve

[3] https://en.wikipedia.org/wiki/List_of_laws#Technology


In terms of depth and breadth, the Princeton companions get close to Feynman.[1][2]

A more formal approach appears in handbooks.[3][4]

[1] Gowers et al., The Princeton Companion to Mathematics. https://press.princeton.edu/books/hardcover/9780691118802/th...

[2] Higham and Dennis, The Princeton Companion to Applied Mathematics. https://press.princeton.edu/books/hardcover/9780691150390/th...

[3] Zwillinger, CRC Standard Mathematical Tables and Formulae. https://www.crcpress.com/CRC-Standard-Mathematical-Tables-an...

[4] Bronshtein, Handbook of Mathematics. https://www.springer.com/gp/book/9783540721222


But is really breadth and depth what makes Feynman’s approach to physics unique?

To me the unique aspect is more the uncompromising intuitionistic approach with little consideration/adaptation for “shallow/correlative thinkers”...


It's easy to gain physical intuition because you can often explain one physical phenomenon in terms of another physical phenomenon that you have much more real life experience with.

But with mathematics, "intuitive" analogies are all in terms of other mathematical objects! You can't build intuition if you don't even know what they trying to abstract over.

In that regards, The Princeton Companion to Mathematics is fantastic because it maps out how the different fields of mathematics are interrelated.


This. I think everyone in this topic is missing the point of what makes the Feynman Lectures unique.


I'd use 3 words to describe Feynman's style: clarity, accessibility, and fascination.


This description made me think of the "Mathologer" channel on YouTube.


Good list! I'd also add Mary Boas' Mathematical Methods in the Physical Sciences. A standard textbook for incoming students across disciplines and very accessible


We used that book for a course and I found it among my less favourite ones. its been a few years since I used it, but I remember it shallow and uninspiring. not trying to start an argument here, maybe just an outlier opinion since this seems a standard textbook.


It was one of my course books as well, but I think it's aimed at the American market and style of learning/presentation. I much preferred Stroud's "Engineering Mathematics" which was a course book for engineers at my university (I studied physics).


>>> aimed at the American style of learning

Ouch! Boas is maybe not as inspiring as Feynman. But when you see a copy on someone's bookshelf. It tends to be just as dog-eared and spine-cracked as Surely You're Joking

Another resource I just thought of. While not a textbook per se. Math competition problems from previous years can be very stimulating ;)

https://www.maths.cam.ac.uk/undergrad/pastpapers/past-ia-ib-...

https://kskedlaya.org/putnam-archive/


I purchased [2], having enjoyed Nick Higham's other book (a treatise on matrix computations), and knowing how well-received [1] was.

But, [2] turned out to be kind of a dud. It was not really fun to browse, and I wasn't sure who it was directed to. The articles that I sampled read like they were intended for academic applied math folks, rather than introductions for interested outsiders. It's a huge book, so YMMV, and has been very well-reviewed by high-profile and well-qualified academics (like Steven Strogatz) but I spent a couple evenings with the book and could not recommend.

In any event, it's not like Feynmann's lectures! It's an encyclopedia.

TLDR: "it was good for someone, but it was not the book I wanted".

(PS: recommending the CRC tables is an odd thing, this is also nothing like Feymann's lectures)


I'm collecting HN meetup groups on GitHub: https://github.com/antontarasenko/hacker-news-groups (BTW, thanks to all contributors from HN)

@lainon HN meetups in Germany:

* Berlin http://www.meetup.com/Berlin-Hacker-News-Meetup/

* Munich https://www.meetup.com/Hacker-News-Munich/


How active is the FB group for San Francisco? I refuse to have a FB account, so I can't check.

If the group owner is reading, can we have a non-FB meetup group please?


"Algorithmic justice" reminded me of a study where researchers predicted the risk of a crime better than judges:[1]

> Millions of times each year, judges must decide where defendants will await trial—at home or in jail. By law, this decision hinges on the judge’s prediction of what the defendant would do if released. This is a promising machine learning application because it is a concrete prediction task for which there is a large volume of data available. Yet comparing the algorithm to the judge proves complicated. First, the data are themselves generated by prior judge decisions. We only observe crime outcomes for released defendants, not for those judges detained. This makes it hard to evaluate counterfactual decision rules based on algorithmic predictions. Second, judges may have a broader set of preferences than the single variable that the algorithm focuses on; for instance, judges may care about racial inequities or about specific crimes (such as violent crimes) rather than just overall crime risk. We deal with these problems using different econometric strategies, such as quasi-random assignment of cases to judges. Even accounting for these concerns, our results suggest potentially large welfare gains: a policy simulation shows crime can be reduced by up to 24.8% with no change in jailing rates, or jail populations can be reduced by 42.0% with no increase in crime rates.

[1] https://www.cs.cornell.edu/home/kleinber/w23180.pdf


The authors note that judges may care explicitly about racial bias, but based on a quick read they're making a really, really big mistake in the language they're using: they confuse arrests with crime. Arrests and convictions are simply a measurement mechanism for crime, which is known to have severe biases.


Kleinberg uses arrests for violent crime because they are known to have substantially less bias/zero bias


There is no convincing evidence that arrest rates "severely" overestimate offense rates; if anything it is just as likely arrest rates underestimate offense rates.


While what you say specifically is true, using arrest rates to determine criminal activity by race (and the subsequent conviction rate, etc.) has been shown to have strong relationship to race; at least in the United States. There are entire books on the subject. You can't tie racial arrest rates to the underlying crime rate, as POC get arrested far more often for the same crimes.


Indeed. In a closed society where everyone is guilty the only crime is getting caught.

This concept is the core basis of the war on drugs.


I'm not sure I understand that correctly. Are you saying that POC are acquitted or have charges dropped far more often for the same crimes (i.e. have a far lower conviction rate)?


POC are more likely to be pulled over, then when pulled over more likely to ask to be searched, then when searched more likely to be arrested when situations are similar to non-POC folks.

It doesn't really stop there either, they are more likely to be convicted of the same crimes and then get longer sentences. They are less likely to be offered probation. This eliminates huge percentages of men permanently from POC communities. It is possible that this process can be blamed for the social issues present in the inner city.

"The New Jim Crow" covers a lot more in a lot more detail. I would strongly recommend the read.


In general, POC are more likely to be arrested for committing a crime.

The parent's point isn't about whether they are acquitted, it's that if you were to commit a crime as a POC, you are more likely to be arrested than if you had committed that same crime as a non-POC. In both scenarios you committed a crime, but in one of them the system never has a record of it. This is why arrest rates and crime rates are different: if a POC is more likely to get arrested for committing a crime, the arrest rates by race (POCs get arrested more) will not reflect the crime rates by race (differences are generally smaller).


Most empirical data indicates that white people are arrested and convicted at a higher rate relative to basal offense rate than black people.


Post your sources. I've seen otherwise (especially for low level drug offenses) but you're the one making the claim.


Communities of color are over-policed. There's a huge racial divide in income, and crimes that tend to be committed by the poor (like shoplifting, loitering, and fare-evasion) are far more likely to be prosecuted than crimes committed by the wealthy (smoking some weed in your suburban living room, fudging your taxes a bit). The end result is that people of color are more likely to have an arrest record, even if they're just as likely to commit a crime as a white person.


For an example of this backed up by data, the NYPD stop-and-frisk program has always overwhelmingly focused on black and Latino people (almost 90% of all people stopped in some years), even though they make up only 15% of the population of some of the precincts involved AND white people were more individually likely to actually have an illegal weapon (the supposed reason for the stop-and-frisk program to exist).

https://www.nyclu.org/en/stop-and-frisk-data https://www.nyclu.org/en/press-releases/analysis-finds-racia...


Just to add a bit, you can look at the justice system as a binary classifier if you squint hard enough. So it has both false positives and false negatives, both of which are difficult to actually measure.

On the one hand, you have poc arrested and convicted of crimes that wouldn't be charged in other parts of town (arguably false negatives, amongst the non poc). On the other, prosecutors use the plea bargain system to get people to plea out for smaller charges instead of risking decades of their lives at trial; this is an excellent way to produce false positives.


Until the populace learns how to improve their chances of getting released and starts to game the system, introducing endogeneity.

(Also noticed the nice coincidence of a professor with user name klienber having a NBER Working Paper)


Which is totally not possible with judges, right ?

I feel like a lot of arguments being made here fail the A vs B test. Any argument that purports to provide help with choosing Judges vs Algorithms needs to apply differently to Judges, and differently to Algorithms.

How about: with Judges we simply won't know (for sure) what influences them. Are they racist ? Who knows. Do they prefere to let people with jobs out (realistically: yes, but we don't know for sure). Do they ...

With algorithm we can literally test, by presenting them with artificial cases, lots of them, and see how they judge. With a judge, you can't.


> I feel like a lot of arguments being made here fail the A vs B test. Any argument that purports to provide help with choosing Judges vs Algorithms needs to apply differently to Judges, and differently to Algorithms.

Out of curiosity, is there a name for this "fallacy", if it is one, since to me it mostly seems like the other party is failing at some basic level of critical thought.

I've been dealing a lot with arguments of this nature at work, and it'd be great to have a name to it. Pointing it out in the verbatim sense ("ok, but that's true of <your counter position> as well") becomes tiring quickly, and honestly, just causes the person to move on to the next fallacious claim.


Well, humans are already capable of dealing with it. The judges know that prisoners know what is expected of a good prisoner. The decisions are already being made with that in mind.

Contrast this with evaluating a programmer's performance. Everyone knows that lines of code written, number of tickets closed, number of fixed bugs or lines of documentation written do correlate well with performance. But the minute they are revealed to impact performance reviews, those metrics becone trash. Until you can find viable instruments, you shouldn't ever put those into a model and expect to have good predictions. If your model is not explicitly equipped to deal with endogeneity (like structural equation models), it will fail when faced with it.

If you think a judge is influenced by things that are unrelated to the case, you should appeal to the court above (which you can readily do in Continental Europe, but I don't know about Common Law).


> Well, humans are already capable of dealing with it

If that were true, algorithms wouldn't be able to outperform those humans on the metrics that matter.

About your faking metrics issue, the trick in this case is simply taking the metrics that matter and feeding them into an algorithm. Problem solved.

Take metrics:

1) will suspect face justice if released

2) will he reintegrate faster if released

Anyone criminal who wants to game those metrics, well I for one will be applauding that !


Until the populace learns how to improve their chances of getting released

As long as that correlates with behavior we want to see from the populace; https://xkcd.com/810/


>By law, this decision hinges on the judge’s prediction of what the defendant would do if released.

That isn’t the legal standard for release pending trial.

Thus is may look like an algorithm “predicts” crime better than judges, but judges can’t withhold bail/bond because they “predict” a certain defendant will commit another crime (because that’s not exactly judges are doing when determining bond).


Milton Friedman on doing this in the 1940s:[1]

> One of my problems was to provide statistical advice to the people who were developing metals to be used in the blades of turbines. I had an enormous amount of data, and I had to construct a regression with five or six different variables having to do with the chemical composition of the metals.

> We estimated that it would take us three months to solve this problem using our desk calculators. In the whole country there was only one calculator—one computer, if you want to call it that—which could do this problem more quickly.

> It was up at Harvard. It wasn’t electronic. It was a whole collection of IBM card sorters. It was in a big, air-conditioned gymnasium, a tremendous collection of sorters all linked by wires. It did our problem for us in forty hours.

As he mentioned elsewhere, it did not work as expected back then.

[1] https://miltonfriedman.hoover.org/friedman_images/Collection...


Freeman Dyson reviews and extends "What is Life?" in his "Origins of Life," both a lecture[1] and a book.

[1] http://inspirehep.net/record/1268726/files/978-4-431-77056-5...


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: