haley.io
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
dynomight@lemmy.worldM to dynomight internet forum@lemmy.worldEnglish · 2 个月前

The Extreme Inefficiency of RL for Frontier Models

www.tobyord.com

external-link
message-square
0
fedilink
2
external-link

The Extreme Inefficiency of RL for Frontier Models

www.tobyord.com

dynomight@lemmy.worldM to dynomight internet forum@lemmy.worldEnglish · 2 个月前
message-square
0
fedilink
The Extreme Inefficiency of RL for Frontier Models — Toby Ord
www.tobyord.com
external-link
The new scaling paradigm for AI reduces the amount of information a model could learn per hour of training by a factor of 1,000 to 1,000,000. I explore what this means and its implications for scaling.
alert-triangle
You must log in or register to comment.

dynomight internet forum@lemmy.world

dynomight@lemmy.world

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !dynomight@lemmy.world

dynomight internet forum

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 2 users / day
  • 2 users / week
  • 5 users / month
  • 92 users / 6 months
  • 1 local subscriber
  • 87 subscribers
  • 84 Posts
  • 104 Comments
  • Modlog
  • mods:
  • dynomight@lemmy.world
  • BE: 0.19.5
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org