2
points
harisec
Joined 29 karma
- harisecUnfortunatelly, i have the same experience.
- Yes, during training multiple checkpoints are created, you can distill from any checkpoint you want.
- 2 points
- Anybody can try Grok3 on Chatbot Arena (even if you are in Europe). Select Direct Chat and select the model early-grok-3. https://lmarena.ai/
- 2 points
- 3 points
- It doesn’t matter much how many users Bluesky is gaining, it matters how many of them will be using Bluesky in a few months. We will see.
- Congrats, good luck with your new company!
I have one question regarding your ARC Prize competition: The current leader from the leaderboard (MindsAI) seems not to be following the original intention of the competition (fine tune a model with millions of tasks similar with the ARC tasks). IMO this is against the goal/intention of the competition, the goal being to find a novel way to get neural networks to generalize from a few samples. You can solve almost anything by brute-forcing it (fine tunning on millions of samples). If you agree with me, why is the MindsAI solution accepted?
- Recraft’s image generation service could leak its internal system prompts due to its unique architecture combining Claude (an AI language model) with a diffusion model. Unlike other image generators, Recraft could perform calculations and answer questions, which led to the discovery that carefully crafted prompts could expose the system’s internal instructions.
- 1 point
- 1 point
- Actually, i think it will take less than 2 years. I've been using Aider + Claude 3.5 Sonnet almost daily for a long time and the progress is very fast. We will see.
- That's a good point and I agree with you. However, would you agree that in a few years we will need far less developers than we need right now?
- These are toys but in 2 years they will probably be full projects and 2 years later people will ask "why do i need a software developer?"
- If you want to really get depressed about the future of software developers try aider.
- I never had this problem but i guess it depends on the prompt.
- Qwen 2.5 models are better than Llama and Mistral.