Title page for ETD etd-11142012-040550
|Type of Document
||Guidry, Jamie Allison
||Improving Discourse Structure Identification
||Master of Science in Engineering Science (M.S.E.S.)
||Engineering Science (Interdepartmental Program)
- rhetorical structure theory
- natural language processing
- semantic analysis
|Date of Defense
Rhetorical Structure Theory (Mann et al. 1988), a popular approach for analyzing discourse coherence, suggests that coherent text can be placed into a hierarchical organization of clauses. Identification of a textís rhetorical structure through automatic discourse analysis is a crucial element for many of todayís Natural Language Processing tasks, but no sufficient tool is available. The current state-of -the-art discourse parser, SPADE (Soricut et al. 2003), is limited to parsing discourse within a single sentence. HILDA (Hernault et al. 2010) extends the parsing abilities of SPADE to the document level, but with a decrease in performance.
This study achieved document-level discourse parsing without sacrificing performance. Provided text was already segmented into elementary discourse units, the task of discourse parsing was separated into three steps: structuring, nuclearity labeling, and relation labeling. An algorithm was developed for classifying relation existence, nuclearity, and relation label that improved upon previous methods. New features were explored for all three steps to maintain state-of-the-art performance when parsing at the document-level.
|| Approximate Download Time
| 28.8 Modem
|| 56K Modem
|| ISDN (64 Kb)
|| ISDN (128 Kb)
|| Higher-speed Access
indicates that a file or directory is
accessible from the LSU campus network only.
If you have questions or technical
problems, please Contact LSU-ETD Support.