# 本章小结

本章深入解析了深度学习的核心原理，从神经网络基础到关键技术，再到主流架构和实践挑战。

## 核心要点回顾

**神经网络基础**

* 人工神经元模拟生物神经元：接收数字 -> 加权求和 -> 激活函数 -> 传递结果
* “深度”就是网络的层数，每一层把信息进行一次抽象
* 激活函数（如 ReLU）引入非线性，让网络能模拟复杂事物
* 亿万个简单神经元连接起来，可以产生“涌现”现象

**深度学习训练**

* 梯度下降：在“黑夜里下山”，沿着下坡方向更新参数
* 学习率：步子太大会震荡，步子太小会太慢，最佳策略是“先快后慢”
* 局部最优陷阱：SGD 和 Adam 等方法通过加“惯性”来跳出小坑

**主流架构**

* CNN（卷积神经网络）：计算机的“眼睛”，专为图像而生
* RNN（循环神经网络）：计算机的“耳朵”，用于序列数据，但记性不好
* Transformer：全能的“大脑”，用注意力机制“一目十行”，是当前的大一统架构

**深度学习的局限性**

* 黑盒问题：无法解释内部推理过程，对关键领域是合规风险
* 能耗问题：训练顶级大模型需要极高电力和算力成本
* 数据饥渴：对海量高质量标注数据极度依赖
* 灾难性遗忘：学习新任务时容易遗忘旧知识
* 幻觉问题：AI 会一本正经地编造假答案

## 下章预告

下一章将专门介绍大语言模型（LLM），深入解析 ChatGPT、Claude 等模型背后的技术原理，从 Transformer 架构到预训练与微调方法，帮助读者理解生成式 AI 的核心技术。

***

> 📝 **发现错误或有改进建议？** 欢迎提交 [Issue](https://github.com/yeasy/ai_beginner_guide/issues) 或 [PR](https://github.com/yeasy/ai_beginner_guide/pulls)。


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://yeasy.gitbook.io/ai_beginner_guide/di-er-bu-fen-he-xin-ji-shu-jie-xi/05_deep_learning/summary.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.