• 2 Posts
  • 28 Comments
Joined 1 year ago
cake
Cake day: June 10th, 2023

help-circle



  • All the latest models are trained on synthetic data generated on got4. Even the newer versions of gpt4. Openai realized it too late and had to edit their license after Claude was launched. Human generated data could only get us so far, recent phi 3 models which managed to perform very very well for their respective size (3b parameters) can only achieve this feat because of synthetic data generated by AI.

    I didn’t read the paper you mentioned, but recent LLM have progressed a lot in not just benchmarks but also when evaluated by real humans.















  • The real question is whether or not it is legal. Theoretically it is possible to do with current tech. If i was making such a tool, i would need access to the ebook then pass it through a llm model (possibly with a 7b open source one) to tag which characters are saying what. Once i have tagged dialogues then I could pass it through elevenlabs or other opensource tts and voila you have an audiobook with different voices.

    The real problem is that opensource tts aren’t as good and i imagine if you use paid versions, you will encounter legal issues or it might be too expensive. And can you sell your audio book? Legal troubles again.

    But if you just wanna do it while sailing the high seas, everything should be possible.





  • I kinda got lost in making that list (just aspergers things) and listed every model i knew.

    For a layperson, yeah self hosting isn’t as effective yet. But if someone who studies AI (like me), self hosting is a must. Some use cases are:

    Retrain on your own data (big market potential)

    Make your own bots with specific applications/use cases (like parse wikipedia before answering)

    Bypass censorship (funny story, my friend asked claude to summarize a book on dystopia and it kept telling her to talk about something else cuz dystopia’s were too depressing for claude)

    I’ve even heard about models that are specialized for just one task like chatting, or logic puzzles

    And lastly, privacy nerds like me