🏠 Beranda
Benchmark
📊 Semua Benchmark 🦖 Dinosaurus v1 🦖 Dinosaurus v2 ✅ Aplikasi To-Do List 🎨 Halaman Bebas Kreatif 🎯 FSACB - Showcase Utama 🌍 Benchmark Terjemahan
Model
🏆 Top 10 Model 🆓 Model Gratis 📋 Semua Model ⚙️ Kilo Code
Sumber Daya
💬 Perpustakaan Prompt 📖 Glosarium AI 🔗 Tautan Berguna

Glosarium AI

Kamus lengkap Kecerdasan Buatan

162
kategori
2.032
subkategori
23.060
istilah
📂
subkategori

Stochastic Markov Decision Processes

MDP where transitions and rewards follow probabilistic distributions, modeling environmental uncertainty.

17 istilah
📂
subkategori

Monte Carlo Methods in RL

Algorithms using repeated random sampling to estimate state-action values in stochastic environments.

14 istilah
📂
subkategori

Stochastic Policies

Strategies returning probability distributions over actions rather than deterministic actions.

11 istilah
📂
subkategori

Bayesian Reinforcement Learning

Approach handling uncertainty over model parameters using probability distributions.

9 istilah
📂
subkategori

Multi-armed Stochastic Bandits

Exploration-exploitation problem where each arm has an unknown stochastic reward distribution.

7 istilah
📂
subkategori

Bootstrap Methods in RL

Techniques using resampling to quantify uncertainty in value estimates.

15 istilah
📂
subkategori

Gaussian Processes for RL

Using Gaussian processes to model uncertainty in the value or transition function.

10 istilah
📂
subkategori

Ensemble Methods in Stochastic RL

Combination of multiple estimators to capture epistemic uncertainty in learning.

19 istilah
📂
subkategori

Distributional Reinforcement Learning

Learning the full distribution of returns rather than only their expected value.

5 istilah
📂
subkategori

Quantile Regression DRL

Specific approach of distributional RL using quantile regression to model uncertainty.

8 istilah
📂
subkategori

Partially Observable Stochastic MDPs

Extension of stochastic MDPs with partial observation, increasing uncertainty about the state.

8 istilah
📂
subkategori

Stochastic Optimization in RL

Optimization methods accounting for noise and uncertainty in gradients and updates.

10 istilah
🔍

Tidak ada hasil ditemukan