面向特定领域的中文对话系统-毕业论文网

论文总字数：29040字

摘要

对话系统是人工智能领域的重要研究课题之一。长期以来，人们一直致力于研究如何赋予机器模仿人类交流的能力，打造更加自然而且智能的人机交互方式。传统的对话系统在实现时依赖大量人工设计的因素，由此带来的巨大开销阻碍了它的大规模应用。本文结合深度学习技术和自然语言处理技术，研究设计一个数据驱动的、面向特定领域的中文对话系统，用于完成餐厅查询的任务。

面向特定领域的对话系统的工作是一个从识别用户意图、跟踪对话状态，最终生成自然语言回复的过程。本文借鉴和实现了剑桥大学对话系统研究组提出的端到端对话模型，使用卷积神经网络对输入文本进行特征提取，从而获取对话状态；在编码器-解码器框架的基础上进行改进，由输入文本的编码结果和对话状态共同解码生成回复。模型设计摒弃了传统实现方案的分模块结构，采用端到端的结构，减少在处理过程中的人工干预，给予系统更多空间发挥其学习能力，进而提高系统整体的准确性和鲁棒性。

对话模型在带有标注的餐厅查询对话语料上进行训练，基于对话模型实现了一个面向餐厅查询任务的对话系统。系统能够准确地捕捉用户输入中的查询条件和请求，做出合理的动作决策。最后结合餐厅知识库的查询结果，生成正确连贯的回复，完成用户的查询请求。

关键词：对话系统，数据驱动，端到端，深度学习

Abstract

Dialogue system is one of the major research topics in the field of artificial intelligence. For a long time, people are dedicated to studying how to teach machines to imitate human communication, building up a more natural and intelligent way of human-machine interaction. Traditional domain-specific dialogue system involves a large number of handcrafted factors, the huge cost thus hinders its large-scale application. This paper combines deep learning technology and natural language processing technology to study and design a data-driven domain-specific Chinese dialogue system, which is aimed to accomplish the task of restaurant query.

The work of a domain-specific dialogue system includes identifying user’s intent, tracking the dialogue state, and eventually generating natural language response. An end-to-end dialogue model proposed by the Cambridge university dialogue system research group is referenced and implemented in this paper. In this model, a convolution neural network is applied to perform feature extraction and get the dialogue state. Then, a basic encoder-decoder framework is modified to use both the encoding result of input text and the dialogue state to generate response. As for the design of model, an end-to-end structure is used to take the place of the traditional modular structure. The system is given more space to exert its learning ability and improves the overall accuracy and robustness.

The dialogue model is trained on the labelled corpus of restaurant query, and a dialogue system for restaurant query task is built base on this model. The system can capture the query within user inputs accurately and take reasonable actions. The query results in knowledge base also helps to complete the generated response and fulfills the query task.

Keywords: Dialogue System, Data-driven, End-to-end, Deep Learning

缩略语中英文对照表

缩写	英文全称	中文全称
ATIS	Air Travel Information Service	航空旅行信息服务
BiRNN	Bi-directional RNN	双向循环神经网络
BLEU	Bilingual Evaluation Understudy	双语评估研究（一种评价翻译质量的指标）
CNN	Convolutional Neural Network	卷积神经网络
DNN	Deep Neural Network	深度神经网络
DM	Dialogue Management	对话管理
DST	Dialogue State Tracking	对话状态跟踪
DSTC	Dialogue State Tracking Challenge	对话状态跟踪挑战
GloVe	Global vectors for word representation	全局的词向量表示
GRU	Gated Recurrent Unit	门控循环单元
LSTM	Long-short Term Memory	长短期记忆
MLE	Maximum Likelihood Estimation	最大似然估计
MLP	Multi-layer Perception	多层感知机
MVC	Model-View-Controller	模型-视图-控制器
NLG	Natural Language Generation	自然语言生成
RNN	Recurrent Neural Network	循环神经网络
Seq2Seq	Sequence to Sequence	序列到序列
SLU	Spoken Language Understanding	口语理解
WSGI	Web Server Gateway Interface	服务器网关接口
word2vec	Word to vector	文本向量化

摘要 I

Abstract II

缩略语中英文对照表 III

第一章绪论 1

1.1 课题研究背景与意义 1

1.1.1 背景 1

1.1.2 意义 1

1.2 国内外研究现状 2

1.2.1 基于规则的对话系统 2

1.2.2 基于检索的对话系统 2

1.2.3 基于框架的对话系统 2

1.2.4 基于状态的对话系统 2

1.3 本文的主要工作 3

第二章对话系统背景知识 4

2.1 对话系统中的深度学习技术 4

2.1.1 自然语言的数字表示 4

2.1.2 循环神经网络 4

2.1.3 长短期记忆 5

2.1.4 编码器-解码器框架 6

2.1.5 注意力机制 6

2.1.6 卷积神经网络 7

2.2 面向特定领域的对话系统简介 8

2.3 分模块的对话模型 8

2.4 端到端的对话模型 9

2.5 本章小结 9

第三章一个面向特定领域的对话模型 10

3.1 任务选择与语料获取 10

3.2 数据处理 11

3.3 对话模型设计 12

3.3.1 对话模型概述 12

3.3.2 对话状态跟踪模型 13

3.3.3 生成式对话模型 14

3.4 对话模型训练 16

3.5 对话模型评估 16

3.6 本章小结 18

第四章面向餐厅查询的对话系统设计与实现 19

4.1 对话系统设计概述 19

4.1.1 任务与目标 19

4.1.2 条件与限制 19

4.2 系统需求分析 19

剩余内容已隐藏，请支付后下载全文，论文总字数：29040字

您需要先支付 80元 才能查看全部内容！立即支付

该课题毕业论文、开题报告、外文翻译、程序设计、图纸设计等资料可联系客服协助查找;

注册

找回密码

面向特定领域的中文对话系统

您可能感兴趣的文章

登录

您可能感兴趣的文章