Profile: ankeshanand - Hacker Neue

ankeshanand

Joined May 29, 2013 250 karma

AI Researcher https://twitter.com/ankesh_anand

1 point Mar 26, 2025

Gemini 2.5 Pro tops LiveBench, +6 pts overall over Claude 3.7 Sonnet Thinking

0 comments ankeshanand livebench.ai
2 points Oct 3, 2024

Gemini 1.5 Flash-8B

0 comments ankeshanand simonwillison.net
ankeshanand Jun 27, 2024

If you're an individual developer and not an enterprise, just go straight to Google AIStudio or GeminiAPI instead: https://aistudio.google.com/app/apikey. It's dead simple getting an API key and calling with a rest client.
ankeshanand Feb 21, 2024

We've done extensive comparisons against GPT-4V for video inputs in our technical report: https://storage.googleapis.com/deepmind-media/gemini/gemini_....
Most notably, at 1FPS the GPT-4V API errors out around 3-4 mins, while 1.5 Pro supports upto an hour of video inputs.
ankeshanand Jul 18, 2023

Has anyone in this subthread actually read the papers and compared the benchmarks? LLama2 is behind PALM-2 on all major benchmarks, I mean they spell this out in the paper explicitly.
ankeshanand Apr 25, 2022

https://aipaygrad.es/
ankeshanand Jan 24, 2022

You can also rent a cloud TPU-v4 pod (https://cloud.google.com/tpu) which 4096 TPUv-4 chips with fast interconnect, amounting to around 1.1 exaflops of compute. It won't be cheap though (excess of 20M$/year I believe).
ankeshanand Jan 12, 2022

It's important in the context that RL does not have performance ceilings.
ankeshanand Jan 12, 2022

Looks like any Github pages served with CloudFlare are getting blocked, I am trying out a fix.
22 points Jan 11, 2022

Reinforcement Learning as a fine-tuning paradigm

7 comments ankeshanand ankeshanand.com
ankeshanand Sep 21, 2021

Yep, Karpathy has mentioned this multiple times in their AI talks.
ankeshanand May 16, 2020

If you carefully curate who you follow, Twitter can be more like a bunch of subreddits, with the added signal of knowing who's posting. So it ends up a being great way to keep up with small communities.
ankeshanand Feb 2, 2020

Sorry if it wasn't clear, I do mention the linear classification protocol several times in the post. If you want to evaluate performance on a classification task, you have to show it labels during evaluation, otherwise it's an impossible task. Note that the encoder is freezed during evaluation, and only a linear classifier is trained on top. Now, even when evaluated on a limited set of labels (as low as 1%), contrastive pretraining outperforms purely supervised training by a large margin (check out Figure 1 in the Data-Efficient CPC paper: https://arxiv.org/abs/1905.09272.
I did not get the second part unfortunately, could you elaborate more and clarify if you are talking about a specific paper?
ankeshanand Feb 1, 2020

I didn't mean to convey that we should abandon generative self-supervised methods, but I can see how comparing them gives that impression.
Agree that using them in conjunction would make sense, since generative methods could capture some features better and vice versa.
97 points Feb 1, 2020

Contrastive Self-Supervised Learning

13 comments ankeshanand ankeshanand.com
ankeshanand Nov 10, 2019

https://twitter.com/karpathy/status/868178954032513024
ankeshanand Oct 10, 2019

Great piece, you might want to update the article with the mention of PyTorch Mobile that released today: https://pytorch.org/mobile/home/
ankeshanand Aug 29, 2019

There's active research in Model-Based RL right now that tries to tackle 1) and 2) together.

This user hasn’t submitted anything.

Preferences

Keyboard Shortcuts

Story Lists

Navigation

Miscellaneous