All
Search
Images
Videos
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
論文紹介:Direct Preference Optimization: Your Language Mod
…
Aug 19, 2024
speakerdeck.com
0:16
Audisi Photo Catalog Fashion Juni 2025: Daftar Sekarang!
1.5K views
5 months ago
TikTok
modelphotocatalogfashion
5:02
11K views · 1.2K reactions | The journey is the reward. As long as
…
11K views
2 weeks ago
Facebook
Steve Kaufmann
59:37
The Evolution of LLM Preference Optimization • Guest Lecture at BI
…
26 views
1 month ago
YouTube
Aman Chadha
21:06
6기 논문 리뷰 📎 DPO(2024.06) Direct Preference Optimization: Your Lan
…
1 views
2 months ago
YouTube
KMU X:AI
7:55
[Paper Review] DPO : Your language model is secretly a reward model
5 views
2 months ago
YouTube
LOADING_
20:06
6기 논문 리뷰 📎 DPO(2024.06) Direct Preference Optimization: Your Lan
…
1 views
2 months ago
YouTube
KMU X:AI
6:46
Aligning LLMs: Preference Tuning. RLHF, Reward modeling, Reinforc
…
2 weeks ago
YouTube
AI Podcast Series. Byte Goose AI.
0:07
𝗥𝗼𝘀𝗲 𝗣𝗮𝘁𝗶𝗻𝗶𝗼𝘁𝗶𝘀 | 𝗧𝗵𝗲 𝗜𝗱𝗲𝗻𝘁𝗶𝘁𝘆 𝗔𝗿𝗰𝗵𝗶𝘁𝗲𝗰𝘁 on Instagram: "✨ Feeling stuck? Here’s how to get moving
…
7.1K views
2 weeks ago
Instagram
innermastery360
1:05
DeepLearning.AI on Instagram: "Our course recommendation of the da
…
4.8K views
1 month ago
Instagram
deeplearningai
1:12
Varun Mayya on Instagram: "Google might have secretly dropped an A
…
860.1K views
3 months ago
Instagram
thevarunmayya
19:38
Reinforcement Learning, RLHF, & DPO Explained
13.3K views
Jun 12, 2024
YouTube
Mark Hennings
LLMs | Alignment of Language Models: Reward Maximization-I | L
…
1.6K views
Sep 20, 2024
YouTube
LCS2
31:03
Direct Preference Optimization Your Language Model is Secretly a Rew
…
584 views
Jun 20, 2023
YouTube
mardin mardin
8:54
Direct Preference Optimization: Your Language Model is Secretly
…
37.5K views
Dec 22, 2023
YouTube
AI Coffee Break with Letitia
Direct Preference Optimization is one of the most significant advanc
…
4.8K views
Jan 26, 2024
TikTok
rajistics
14:28
Markov Decision Process (MDP) Tutorial
119.8K views
Dec 16, 2012
YouTube
José Vidal (José M Vidal)
1:17:10
Introduction to Total Rewards
6.5K views
Jul 1, 2020
YouTube
GreggU
1:01
Nasion Patriotik on Instagram: "Model internasional lur #Nasionp"
762.2K views
3 months ago
Instagram
nasionp
9:38
Maya Tutorial: Model a Coffee Cup
277.1K views
Apr 4, 2021
YouTube
What Make Art
6:31
How Habits Can Change Your Life (and Your Brain)
1.1M views
Aug 28, 2018
YouTube
Be Smart
17:06
How to Change your System Language completely in Windows
…
637.4K views
Jan 13, 2017
YouTube
vSAM
7:49
LM part of the IS-LM model | Macroeconomics | Khan Academy
781.2K views
Apr 11, 2012
YouTube
Khan Academy
6:25
11 Body Language Signs She's Attracted To You - HIDDEN Signal
…
7.8M views
Jan 30, 2018
YouTube
MantelligenceDating
1:23
How to check laptop model | Laptop model number check
916.2K views
Aug 22, 2020
YouTube
Open Box Tech
7:44
How Top Model Anok Yai Gets Runway Ready | Diary of a Model
…
4.9M views
Sep 11, 2019
YouTube
Vogue
22:58
What is Financial Modeling? Explanation & Setup of a Financia
…
202.7K views
May 11, 2021
YouTube
Eric Andrews
27:35
Deepseek r1 (prepare) - RLHF & PPO & GRPO
411 views
5 months ago
YouTube
酸果酿
37:38
AI Agents 6 - Memory, Learning, and Adapation
157.8K views
1 month ago
YouTube
Prof. Ghassemi Lectures and Tutorials
14:15
Direct Preference Optimization
772 views
Apr 9, 2024
YouTube
Data Science Gems
See more videos
More like this
Feedback