Hacker News new | past | comments | ask | show | jobs | submit | l1n's comments login


Is it? I'm not familiar enough with those certifications to know if that requires them to divest of their offerings in China.


It’s not. They’ve held FedRAMP certifications for more than a decade.

Source: ex-Akamai InfoSec employee


The requirements of the certification might change for political reasons.


That's not something that's currently planned, and even if it did, it would take years. Besides, it wouldn't apply anyway; Akamai's China business was already routed through a mainland China company, as required by Chinese law.


It generally takes a long time to change the text of a formal requirement. Changing the meaning of the existing text doesn't take so long.


Curious what your robots.txt looked like, if you have a link?


403 is generally a bad way to get crawlers to go away - https://developers.google.com/search/blog/2023/02/dont-404-m... suggests a 500, 503, or 429 HTTP status code.


> 403 is generally a bad way to get crawlers to go away

Hardly... the article links says that a 403 will cause Google to stop crawling and remove content... that's the desired outcome.

I'm not trying to rate limit, I'm telling them to go away.


That article describes the exact behaviour you want from the AI crawlers. If you let them know they’re rate limited they’ll just change IP or user agent.


I made these, proceeds go to the SWF folks https://www.etsy.com/listing/1371574690/shrimp-want-me-unali...


Multiple system prompt segments can be composed depending on needs, so it's useful for this sort of thing to be there to resolve inconsistencies.


https://gerrit.libreoffice.org/c/core/+/172801

Pretty short change for reducing O(n^2) impact with a cache.

This change includes the following scalability improvements for documents containing extremely large paragraphs:

- Reduces the size of layout contexts to account for LF control chars.

- Due to typical access patterns while laying out paragraphs, VCL was making O(n^2) calls to vcl::ScriptRun::next(). VCL now uses an existing global LRU cache for script runs, avoiding much of this overhead.



Thank you. Also https://bugs.documentfoundation.org/show_bug.cgi?id=92064

I lack the context - are they still layong out the widths of characters when wrapping?


Probably shows a bit how little that software is used with Tibetan text if this bug was able to stay open for almost 10 years for what ultimately was a 5 line fix.


The fix looks like a 5 line fix because it is a last step in a very long process of optimizing LibreOffice text layout that started years ago. This 5 line fix could not have been possible 10 years ago simply because the code it is fixing didn't exist back then.


> Probably shows a bit how little that software is used with Tibetan text

... by the LibreOffice devs in Indo-European-speaking countries.


Apparently by anyone if the bug description is accurate. Seemingly one cannot open sufficiently long documents let alone write into them.


Perhaps: from the article:

So long as LibreOffice could not handle long paragraphs there was essentially no free tool to publish Tibetan.


Is the form not loading for you? Works for me, Firefox + Mac


Firefox + Win11, only loads some text and no actual contact methods.


Same for me with Firefox and macOS. The form is getting blocked by Firefox's "Enhanced Tracking Protection" feature. This is the request that's blocked: "GET https://js.hsforms.net/forms/embed/v2.js"


> Sites use robots.txt to tell well-behaved web crawlers what data is up for grabs and what data is off limits. Anthropic ignores it and takes your data anyway. That’s even if you’ve updated your robots.txt with the latest configuration details for Anthropic. [404 Media]

doesn't seem supported by the citation, https://www.404media.co/websites-are-blocking-the-wrong-ai-s...



https://www.jinki.jp/ their website is banger


Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: