Preferences

leodriesch
Joined 1,110 karma

  1. That's what I really like about Flux Kontext, it has similar editing capabilities to the multimodal models, but doesn't mess up the details. The editing with gpt-image-1 only really works for complete style changes like "make this ghibli", but not adding glasses to a photorealistic image and have it retain all the details.
  2. I got SAM 1 to work with MPS device on my MacBook Pro M1, don’t know if it works with this one too.
  3. I ran it with 3040x3040px images on my MacBook M1 Pro in about 9 seconds + 200ms or so for the masking.
  4. I’d guess testing hardware is same as training hardware, so A100. If it was on a mobile device they would have definitely said that.
  5. I think it’s fair to leave it out in the on-device model comparison. 3b is much smaller than 8b, it is obviously not going to be as good as llama 3 if they did not make groundbreaking advancements with the technology.
  6. The model is always wrong, since it predicts a propability distribution over all possible tokens, but the target has 100% possibility for one token and 0 for all others.
  7. I am really impressed by the Apple Maps implementation. I think it also uses textured polygons, but does so in a very good looking way and at 120 fps on an iPhone, showing even a whole city in textured 3d.
  8. How does this compare to MLX? As far as I understand MLX is equivalent to PyTorch but optimized for Apple Silicon.

    Is this meant for training MLX models in a distributed manner? Or what is its purpose?

  9. I'd say most are thinking of Midjourneys success in image generation when talking about this kind of progress.
  10. I’d say talent? Outside of OpenAI no team has been able to release a model as capable as GPT-4, and I’m unsure if the CIA has been prioritizing LLM experts in their hiring.
  11. Could you link me to a finetune optimized for function calling? I was looking for one a few weeks ago but did not find any.
  12. The purple icon in the shared transcript indicates GPT-4.
  13. It’s a movie about a humanlike personal AI companion.

    https://m.imdb.com/title/tt1798709/

  14. Their privacy policy lists OpenAI as one of their partners for data processing, which indicates that this is happening not on your device, and data is also shared with third parties.

    For me this is the main counterargument against apps like these. I want to feel free to post any information into this without thinking about who may read or use it.

    Local is the only way to go for software like this in my opinion.

  15. This is just a prompting hack you can use with any LLM, not exclusive to Claude. But I do like the fact that they include these tricks in their documentation.

This user hasn’t submitted anything.

Keyboard Shortcuts

Story Lists

j
Next story
k
Previous story
Shift+j
Last story
Shift+k
First story
o Enter
Go to story URL
c
Go to comments
u
Go to author

Navigation

Shift+t
Go to top stories
Shift+n
Go to new stories
Shift+b
Go to best stories
Shift+a
Go to Ask HN
Shift+s
Go to Show HN

Miscellaneous

?
Show this modal