Preferences

harisec
Joined 29 karma

  1. Unfortunatelly, i have the same experience.
  2. Yes, during training multiple checkpoints are created, you can distill from any checkpoint you want.
  3. Anybody can try Grok3 on Chatbot Arena (even if you are in Europe). Select Direct Chat and select the model early-grok-3. https://lmarena.ai/
  4. It doesn’t matter much how many users Bluesky is gaining, it matters how many of them will be using Bluesky in a few months. We will see.
  5. Congrats, good luck with your new company!

    I have one question regarding your ARC Prize competition: The current leader from the leaderboard (MindsAI) seems not to be following the original intention of the competition (fine tune a model with millions of tasks similar with the ARC tasks). IMO this is against the goal/intention of the competition, the goal being to find a novel way to get neural networks to generalize from a few samples. You can solve almost anything by brute-forcing it (fine tunning on millions of samples). If you agree with me, why is the MindsAI solution accepted?

  6. Recraft’s image generation service could leak its internal system prompts due to its unique architecture combining Claude (an AI language model) with a diffusion model. Unlike other image generators, Recraft could perform calculations and answer questions, which led to the discovery that carefully crafted prompts could expose the system’s internal instructions.
  7. Actually, i think it will take less than 2 years. I've been using Aider + Claude 3.5 Sonnet almost daily for a long time and the progress is very fast. We will see.
  8. That's a good point and I agree with you. However, would you agree that in a few years we will need far less developers than we need right now?
  9. These are toys but in 2 years they will probably be full projects and 2 years later people will ask "why do i need a software developer?"
  10. If you want to really get depressed about the future of software developers try aider.
  11. I never had this problem but i guess it depends on the prompt.
  12. Qwen 2.5 models are better than Llama and Mistral.

This user hasn’t submitted anything.