بدون توضیح

Gen TANG ab3894f709 update readme in english 2 سال پیش
ch03_linear a75c7456d6 update readme for ch03 2 سال پیش
ch04_logit e6c0e50630 update comment for ch04 2 سال پیش
ch05_econometrics 2d95750b98 update comment for ch05 2 سال پیش
ch06_optimizer 05d6077132 update comment for ch06 2 سال پیش
ch07_autograd e18835b1cd update comment for ch07 2 سال پیش
ch08_mlp 35ba0ddcea update comment for ch09 2 سال پیش
ch09_cnn 35ba0ddcea update comment for ch09 2 سال پیش
ch10_rnn 35f6d19e21 typo in ch10 readme 2 سال پیش
ch11_llm 098ee9208e typo in ch11 2 سال پیش
ch12_rl af872ca8ce typo 2 سال پیش
ch13_others af872ca8ce typo 2 سال پیش
.gitignore e18835b1cd update comment for ch07 2 سال پیش
LICENSE 0a9008828e Initial commit 2 سال پیش
README.md ab3894f709 update readme in english 2 سال پیش

README.md

The Dance of Probability: A Journey from Models to Language in Artificial Intelligence

This book is currently in the editing process and will be available soon.

Description

For classic models in AI(artificial intelligence), the tools, such as PyTorch, have provided well-encapsulated implementations, making their usage relatively straightforward. However, due to engineering considerations, these implementations introduce excessive details into the code, complicating model understanding. This book aims to enhance reader comprehension by re-implementing core model parts with detailed annotations. While explaining complex algorithms in human language can be challenging, reading the code proves more intuitive.

The code relies on external libraries, with installation commands given at the beginning of the scripts. Rerunning might yield slight result variations due to random numbers, but the overall impact is minimal. For large language models, running the code on a GPU is crucial to avoid a significant increase in computation time.

Outline

- ch03_linear: Linear Regression
- ch04_logit: Logistic Regression
- ch05_econometrics: Insights from Econometrics
- ch06_optimizer: Optimization Algorithms
- ch07_autograd: Backpropagation
- ch08_mlp: Multilayer perceptron
- ch09_cnn: Convolutional Neural Network
- ch10_rnn: Recurrent Neural Network
- ch11_llm: Large Languague Model
- ch12_rl: Reinforcement Learning
- ch13_others: Other Classic Models