飞桨2.0实例教程 — 使用BERT实现自动写诗 #968

fiyen · 2021-04-17T05:08:46Z

No description provided.

删除了1312-1315行多余代码，训练和评测verbose改为1并重新生成了输出。

删除了原先的位置，增加了预训练词向量文件夹

修改了两处说明（Line9,190），修改了最近修改时间（Line15）

rewrite some discriptions

更新

TCChenlong · 2021-04-26T07:37:00Z

整个文档要分章节整理，这样能够带给读者明确的阅读路线，例如1. 数据处理，2. 模型构建 3. 模型训练， 4 模型评估等等
数据处理部分，最好能画图展示数据处理前后数据的样子，例如tokenizer之前数据的样子和tokenizer后的样子，数据padding后的样子，并且期间最好展示一下bert的输入数据包含哪些，格式是什么。
实验之前最好能介绍下自动写诗的任务是什么样的，介绍数据集是什么样的

补充了对输入的详细说明；将数据集替换为飞桨官方数据集；补充了自动写诗的说明；对章节进行了划分和标序

TCChenlong · 2021-06-09T03:34:08Z

1、不建议将 import paddlenlp as ppnlp 直接 import paddlenlp 即可
2、paddlenlp 发布了2.0版本，有些API的行为发生了变化，需要更新代码以适配新版paddlenlp~

update the newest url for paddlenlp; update the new API for paddlenlp.

merge op of load_dataset for test, dev, and train.

TCChenlong

LGTM

TCChenlong

整体写的很好，有一些表述需要完善下，辛苦~

TCChenlong · 2021-06-18T13:12:57Z

...le2.0_docs/pretrained_bert_for_poetry_generation/pretrained_bert_for_poetry_generation.ipynb

+    "### 3.1 预训练BERT模型\n",
+    "古诗生成是一个文本生成的过程，在实际中模型无法获知还未生成的内容，也即BERT中的双向关系中只能捕捉到前向关系而不能捕捉到后向关系。这个限制我们可以通过添加注意力掩码（attention mask）来屏蔽掉后向的关系，使模型无法注意到还未生成的内容，从而使BERT仍能完成文本生成任务。\n",
+    "\n",
+    "进一步地，我们可以将文本生成简化为基于BERT的词分类模型（理解为词性标注），即赋予每个词一个标签，该标签即该词后的下一个词是什么。因此，我们直接调用PaddleNLP的BERT词分类模型即可看，需注意模型分类的类别为词表长度。"


即赋予每个词一个标签，该标签即该词后的下一个词是什么。
这一句的描述不是很清晰，建议补充一个例子来说明；
因此，我们直接调用PaddleNLP的BERT词分类模型即可看，
这一句似乎不全吧？

TCChenlong · 2021-06-18T13:17:30Z

...le2.0_docs/pretrained_bert_for_poetry_generation/pretrained_bert_for_poetry_generation.ipynb

+   },
+   "source": [
+    "## 4. 古诗生成\n",
+    "以下，我们定义一个类来利用已经训练好的模型完成古诗生成的任务。在生成古诗的过程中，我们将已经生成的内容作为输入，编码后输入模型，得到输入中每个词对应的分类结果。然后选取最后一个词的分类结果作为下一个待预测的词。下一轮中，刚刚预测的词将加入到已生成的内容中，继续进行下一个词的预测。\n",


然后选取最后一个词的分类结果作为下一个待预测的词
这句有点歧义，建议改为：
作为下一个待预测的词 -> 作为根据当前内容要预测的词

TCChenlong · 2021-06-18T13:18:07Z

...le2.0_docs/pretrained_bert_for_poetry_generation/pretrained_bert_for_poetry_generation.ipynb

+    "        self.sequence_length = input_length\r\n",
+    "        self.lower_triangle_mask = paddle.tril(paddle.tensor.full((input_length, input_length), 1, 'float32'))\r\n",
+    "\r\n",
+    "    def forward(self, token, token_type, input_mask, input_length=None):\r\n",


这一段代码可以多加一些注释，帮助读者理解~

add more descriptions.

fiyen added 9 commits November 24, 2020 23:21

飞桨2.0实例教程——使用预训练词向量

abffa64

飞桨2.0实例教程——使用预训练词向量

32778b0

删除了1312-1315行多余代码，训练和评测verbose改为1并重新生成了输出。

Delete pretrained_word_embeddings.ipynb

5c51386

飞桨2.0应用案例——使用预训练的词向量

902af1c

删除了原先的位置，增加了预训练词向量文件夹

Update pretrained_word_embeddings.ipynb

53f9677

修改了两处说明（Line9,190），修改了最近修改时间（Line15）

modified based on the latest comments

d92b89c

Create README.md

88a02f4

Add files via upload

02a7744

Add files via upload

4fc5af6

rewrite some discriptions

fiyen closed this Apr 18, 2021

fiyen added 2 commits April 18, 2021 17:18

Update pretrained_word_embeddings.ipynb

e175b3d

Merge pull request #1 from PaddlePaddle/develop

2748654

更新

fiyen reopened this Apr 18, 2021

Add files via upload

31dd253

补充了对输入的详细说明；将数据集替换为飞桨官方数据集；补充了自动写诗的说明；对章节进行了划分和标序

fiyen added 2 commits June 9, 2021 15:41

Add files via upload

7f25d54

update the newest url for paddlenlp; update the new API for paddlenlp.

Add files via upload

fbec11f

merge op of load_dataset for test, dev, and train.

TCChenlong approved these changes Jun 17, 2021

View reviewed changes

TCChenlong reviewed Jun 18, 2021

View reviewed changes

Add files via upload

9fa7c4c

add more descriptions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

飞桨2.0实例教程 — 使用BERT实现自动写诗 #968

飞桨2.0实例教程 — 使用BERT实现自动写诗 #968

fiyen commented Apr 17, 2021

TCChenlong commented Apr 26, 2021

TCChenlong commented Jun 9, 2021

TCChenlong left a comment

TCChenlong left a comment

TCChenlong Jun 18, 2021

TCChenlong Jun 18, 2021

TCChenlong Jun 18, 2021

飞桨2.0实例教程 — 使用BERT实现自动写诗 #968

Are you sure you want to change the base?

飞桨2.0实例教程 — 使用BERT实现自动写诗 #968

Conversation

fiyen commented Apr 17, 2021

TCChenlong commented Apr 26, 2021

TCChenlong commented Jun 9, 2021

TCChenlong left a comment

Choose a reason for hiding this comment

TCChenlong left a comment

Choose a reason for hiding this comment

TCChenlong Jun 18, 2021

Choose a reason for hiding this comment

TCChenlong Jun 18, 2021

Choose a reason for hiding this comment

TCChenlong Jun 18, 2021

Choose a reason for hiding this comment