Profile Picture
  • All
  • Search
  • Local Search
  • Images
  • Videos
  • Maps
  • More
    • News
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
Reinforcement Learning with Human Feedback (RLHF) - How to train and fine-tune Transformer Models
15:31
Reinforcement Learning with Human Feedback (RLHF) - How to train an…
30.2K viewsFeb 12, 2024
YouTubeSerrano.Academy
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實戰,打磨技巧 (Reinforcement Learning from Human Feedback, RLHF)
36:59
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實 …
78.2K viewsApr 12, 2024
YouTubeHung-yi Lee
Reinforcement Learning from Human Feedback (RLHF) Explained
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
69.1K viewsAug 7, 2024
YouTubeIBM Technology
Reinforcement Learning, RLHF, & DPO Explained
19:39
Reinforcement Learning, RLHF, & DPO Explained
13.3K viewsJun 12, 2024
YouTubeMark Hennings
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
28:53
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
18.1K views9 months ago
YouTubeShaw Talebi
Reinforcement Learning with Human Feedback (RLHF) in 4 minutes
4:06
Reinforcement Learning with Human Feedback (RLHF) in 4 minutes
10.5K views10 months ago
YouTubeSebastian Raschka
RLHF Visualizer | Hands-on Reinforcement Learning
45:51
RLHF Visualizer | Hands-on Reinforcement Learning
2.8K views2 months ago
YouTubeVizuara
3:14:37
RLHF from scratch, step-by-step, in code
129 views5 months ago
YouTubeAshwani Kumar
6:06:21
【6小时教程】完整 LLM 实战课程:从 Transformer 到 RLHF 全流程
2.9K views2 months ago
bilibiliAIDeepCoder
1:18:00
RLHF Explained & Coded (feat. PPO)
153 views3 months ago
YouTubeAIArchives
See more videos
Static thumbnail place holder
More like this
Feedback
  • Privacy
  • Terms