Profile: Deathmax - Hacker Neue

Deathmax

Joined Aug 10, 2012 492 karma

[ my public key: https://keybase.io/jscheah; my proof: https://keybase.io/jscheah/sigs/kbHB3WdgZl4UkIZq1g2joy0-oOyBv_T68Z8uxEiIFyQ ]

Deathmax Dec 13, 2025 parent

NordVPN calls out when a location is virtual, so unless ipinfo is claiming they have virtual locations that are not labelled as such, they are at least transparent about it. They did document the physical server locations of their virtual locations at launch, but I'm not sure if there's a live doc for new locations. https://nordvpn.com/blog/new-nordvpn-virtual-servers/
Deathmax Dec 11, 2025 parent

The author was running a quantised version of GLM 4.5 _Air_, not the full fat version. API pricing for that is closer to $0.2/$1.1 at the top end from z.ai themselves, half the price from Novita/SiliconFlow.
Deathmax Nov 20, 2025 parent

You sort of can on Android, but it's a few steps:
1. Trigger Circle to Search with long holding the home button/bar
2. Select the image
3. Navigate to About this image on the Google search top bar all the way to the right - check if it says "Made by Google AI" - which means it detected the SynthID watermark.
Deathmax Nov 19, 2025 parent

Depending on which shared GCP project you get assigned to, mine had a global 300 million tokens per minute quota that was being hit regularly.
Deathmax Nov 18, 2025 parent

Bad news then, they've bumped 3.0 Pro pricing to $2/$12 ($4/$18 at long context).
Deathmax Nov 15, 2025 parent

So the Bigscreen Beyond 1 and 2? They've optimised entirely for weight at ~100 grams.
Deathmax Oct 29, 2025 parent

I can finally tear down my custom DERP server that I was using to get higher throughput between two NAT'd clients.
Deathmax Oct 25, 2025 parent

Not anymore, especially after other routers like Vercel's AI Gateway and proxies from LLM providers like Fal, DeepInfra, and AtlasCloud didn't get the memo of enforcing BYOK for ID verification required models after GPT-5's release.
Deathmax Oct 23, 2025 parent

As if the App Store had any sort of those guarantees. I know of people have been scammed via WebView wrappers that purported to be some benign app to pass app store review, which were then pointed at fake exchange websites afterwards. GitLab which was hosting their C&C mechanism took action faster than Apple or Google did to take down multiple scam apps across multiple different developer identities, but the scammers spun up new apps the next day.
Deathmax Aug 31, 2025 parent

Victims would almost certainly have transferred funds to money mules, who would have then immediately broken digital audit trails to the ultimate destination by withdrawing as cash before passing it around.
Deathmax Aug 19, 2025 parent

Riot documents the need to have IOMMU support enabled for Vanguard: https://support-valorant.riotgames.com/hc/en-us/articles/222...
Deathmax Jul 11, 2025 parent

Vertex's offering of Gemini very much does implicit caching, and has always been the case [1]. The recent addition of applying implicit cache hit discounts also works on Vertex, as long as you don't use the `global` endpoint and hit one of the regional endpoints.
[1]: http://web.archive.org/web/20240517173258/https://cloud.goog..., "By default Google caches a customer's inputs and outputs for Gemini models to accelerate responses to subsequent prompts from the customer. Cached contents are stored for up to 24 hours."
Deathmax Jun 30, 2025 parent

Gemini uses SentencePiece [1], and the proprietary Gemini models share the same tokenizer vocabulary as Gemma [2, 3, 4].
Out of the large proprietary western AI labs (OpenAI, Anthropic, Google), only Anthropic with Claude 3 and newer lack local tokenizers.
[1] https://github.com/google/sentencepiece
[2] https://github.com/googleapis/python-aiplatform/blob/main/ve...
[3] https://storage.googleapis.com/deepmind-media/gemma/gemma-2-...: "We inherit from the large Gemini vocabulary (256k entries)."
[4] https://storage.googleapis.com/deepmind-media/gemma/Gemma3Re...: "We use the same tokenizer as Gemini 2.0."
Deathmax Jun 29, 2025 parent

It's a change to the CA rules that was passed in https://cabforum.org/2022/04/06/ballot-csc-13-update-to-subs... to align OV certificate requirements with the EV ones (that enforces the use of HSMs/hardware tokens/etc) that was meant to go into effect for new certificates issued after November 2022, but was delayed and eventually implemented on June 1 2023.
Deathmax Jun 20, 2025 parent

Since April 2023 they support custom OIDC providers[1], and as of April 2024 that was extended to the free plan as well[2], so you can bring your own auth.
[1]: https://tailscale.com/kb/1240/sso-custom-oidc
[2]: https://tailscale.com/blog/sso-tax-cut
Deathmax Jun 18, 2025 parent

https://www.minimaxi.com is their website for the Chinese parent company 上海稀宇科技有限公司, https://minimax.io is their international website for the Singapore based company Nanonoble Pte Ltd that handles operations outside of China.
Deathmax Jun 10, 2025 parent

I did read it, and I even went to their eval repo.
> At the time of writing, there are two major versions available for GPT-4 and GPT-3.5 through OpenAI’s API, one snapshotted in March 2023 and another in June 2023.
openaichat/gpt-3.5-turbo-0301 vs openaichat/gpt-3.5-turbo-0613, openaichat/gpt-4-0314 vs openaichat/gpt-4-0613. Two _distinct_ versions of the model, and not the _same_ model over time like how people like to complain that a model gets "nerfed" over time.
Deathmax Jun 10, 2025 parent

Your linked article is specifically comparing two different versioned snapshots of a model and not comparing the same model across time.
You've also made the mistake of conflating what's served via API platforms which are meant to be stable, and frontends which have no stability guarantees, and are very much iterated on in terms of the underlying model and system prompts. The GPT-4o sycophancy debacle was only on the specific model that's served via the ChatGPT frontend and never impacted the stable snapshots on the API.
I have never seen any sort of compelling evidence that any of the large labs tinkers with their stable, versioned model releases that are served via their API platforms.
Deathmax May 21, 2025 parent

Also known as Claude 3.5 Sonnet V2 on AWS Bedrock and GCP Vertex AI
Deathmax May 20, 2025 parent

It's not a 4B parameter model. The E4B variant is 7B parameters with 4B loaded into memory when using per-layer embedding cached to fast storage, and without vision or audio support.
Deathmax May 8, 2025 parent

Gemini's free tier will absolutely use your inputs for training [1], same with Mistral's free tier [2]. Anthropic and OpenAI let's you opt into data collection for discounted prices or free tokens.
[1]: https://ai.google.dev/gemini-api/terms#data-use-unpaid
[2]: https://mistral.ai/terms#privacy-policy
Deathmax May 4, 2025 parent

Thoughts used to be available in the Gemini/Vertex APIs when Gemini 2.0 Flash Thinking Experimental was initially introduced [1][2], and subsequently disabled to the public (I assume hidden behind a visibility flag) shortly after DeepSeek R1's release [3] regardless of the `include_thoughts` setting.
At ~10:15AM UTC 04 May, a change was rolled out to the Vertex API (but not the Gemini API) that caused the API to respect the `include_thoughts` setting and return the thoughts. For consumers that don't handle the thoughts correctly and had specified `include_thoughts = true`, the thinking traces then leaked into responses.
[1]: https://googleapis.github.io/python-genai/genai.html#genai.t...
[2]: https://ai.google.dev/api/generate-content#ThinkingConfig
[3]: https://github.com/googleapis/python-genai/blob/157b16b8df40...
Deathmax May 4, 2025 parent

Can we avoid weekend changes to the API? I know it's all non-GA, but having `includeThoughts` suddenly work at ~10AM UTC on a Sunday and the raw thoughts being returned after they were removed is nice, but disruptive.
Deathmax May 4, 2025 parent

> You can’t just attach an image to your request.
You can? Google limits HTTP requests to 20MB, but both the Gemini API and Vertex AI API support embedded base64-encoded files and public URLs. The Gemini API supports attaching files that are uploaded to their Files API, and the Vertex AI API supports files uploaded to Google Cloud Storage.
Deathmax Apr 17, 2025 parent

It is gated behind the GOOGLE_INTERNAL visibility flag, which only internal Google projects and Cursor have at the moment as far as I know.
Deathmax Mar 1, 2025 parent

That limitation should go away when Trusted Signing graduates from preview to GA. The current limitation is because the CA rules say you must perform identity validation of the requester for orgs younger than 3 years old, which Microsoft isn't set up for yet.
Deathmax Feb 15, 2025 parent

Their primary business model nowadays is as an advertising agency, not book selling: https://www.guinnessworldrecords.com/business-marketing-solu...
Deathmax Feb 11, 2025 parent

The cables between 12V-2x6 and 12VHPWR are identical, it's the port that has different pin lengths (shorter sensing pin, longer conductor pins) to allow for better detection of poorly seated cables and better conductivity while loose.
Deathmax Feb 8, 2025 parent

It's helpful for evading detection, because if you've compromised a machine, you can drop in the server binary and it'll have been added to the allowlist for devs to run.
Deathmax Jan 31, 2025 parent
It's simple enough to test the tokenizer to determine the base model in use (DeepSeek V3, or a Llama 3/Qwen 2.5 distill).
Using the text "സ്മാർട്ട്", Qwen 2.5 tokenizes as 10 tokens, Llama 3 as 13, and DeepSeek V3 as 8.
Using DeepSeek's chat frontend, both DeepSeek V3 and R1 returns the following response (SSE events edited for brevity):
```
  {"content":"സ","type":"text"},"chunk_token_usage":1
  {"content":"്മ","type":"text"},"chunk_token_usage":2
  {"content":"ാ","type":"text"},"chunk_token_usage":1
  {"content":"ർ","type":"text"},"chunk_token_usage":1
  {"content":"ട","type":"text"},"chunk_token_usage":1
  {"content":"്ട","type":"text"},"chunk_token_usage":1
  {"content":"്","type":"text"},"chunk_token_usage":1
```
which totals to 8, as expected for DeepSeek V3's tokenizer.

This user hasn’t submitted anything.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous