@apple → various startups → @featurepeek (YC S19) → @netlify
now working on https://pure.md
- andrethegiantUse cloudflare to redirect requests that have text/plain in the accept header to use the corresponding api endpoint
- I have no plans in downloading Atlas either, but I think your browsing isn't used for training unless you opt in.
> By default, we don’t use the content you browse to train our models. If you choose to opt-in this content, you can enable “include web browsing” in your data controls settings. Note, even if you opt into training, webpages that opt out of GPTBot, will not be trained on.
- What’s wrong with it? It’s a great word
- 1 point
- I want one just to jailbreak
- I am so sick of these anthropomorphized names that have nothing to do with anything that we’re all supposed to remember now. Why are we giving products first names? The worst offender is probably Amazon Rufus. It’s all so dumb and I hate it. At least attempt to be clever and name it something that relates to the product itself. Even Google Wave, despite its shortcomings, made sense as a product name.
- > I think there should also be a reasonable cap on the number of connections that can be made. Something like 300 friends sounds right. Any more than that and you're a collector, and not using the platform to foster connection.
Path[1] did that, but with a cap of 50, and then 150 (based on the Dunbar number of meaningful human connections one can retain). They had a crazy growth period but eventually went kaput.
- The mitochondria is the powerhouse of the cell
- Doesn’t this conflict with the original proposal of appending .md to any resource, e.g. /foo/bar.html.md? Or why not tell servers to respond to the Accept header when it’s set to text/markdown?
- <link rel="alternate" /> is a standards-friendly way to semantically represent the same content in a different format
- > If anyone has a browser plugin which automatically redirects to llms.txt sign me up.
Not a browser plugin, but you can prefix URLs with `pure.md/` to get the pure markdown of that page. It's not quite a 1:1 to llms.txt as it doesn't explain the entire domain, but works well for one-off pages. [disclaimer: I'm the maintainer]
- Too busy rolling in money to actually write a substantial announcement?
- Free tier (i.e. using API keys but without a paid subscription) is rate-limited to 10 requests per minute. https://pure.md/docs/#section/Rate-limits
- > it seems to crash on any Arxiv PDF
Fixed, thanks for reporting :-)
- Could be. But their pricing for the conversion is free, which leads me to believe LLMs are not involved.
- Apart from lacking newlines, how is the result bad? It extracts the text for easy piping into an LLM.
- Does it work on YouTube videos that have transcripts disabled?
- Cloudflare’s ai.toMarkdown() function available in Workers AI can handle PDFs pretty easily. Judging from speed alone, it seems they’re parsing the actual content rather than shoving into OCR/LLM.
Shameless plug: I use this under the hood when you prefix any PDF URL with https://pure.md/ to convert to raw text.