NetNews Usenet Archive 1992 #31

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #31 / NN_1992_31.iso / spool / rec / arts / intfict / 1127 < prev next >

Wrap

Text File | 1992-12-22 | 3.3 KB | 71 lines

Newsgroups: rec.arts.int-fiction Path: sparky!uunet!paladin.american.edu!darwin.sura.net!udel!rochester!cantaloupe.srv.cs.cmu.edu!crabapple.srv.cs.cmu.edu!mkant From: mkant+@cs.cmu.edu (Mark Kantrowitz) Subject: Re: Text Generation Similar to Code Generation Message-ID: <BzMwFJ.JtD.1@cs.cmu.edu> Sender: news@cs.cmu.edu (Usenet News System) Nntp-Posting-Host: glinda.oz.cs.cmu.edu Organization: School of Computer Science, Carnegie Mellon References: <9754280@MVB.SAIC.COM> Date: Tue, 22 Dec 1992 00:09:18 GMT Lines: 58 In article <9754280@MVB.SAIC.COM> Whitten@Fwva.Saic.Com (David Whitten) writes: >As I understand it, a compiler may generate inefficient or ugly code, >but then a second program looks at what it generated and applies some >pattern matching to it. When the output matches specific patterns, >then the machine code is replaced with shorter/faster/better code that >does the same thing. This program is called the peephole optimizer >because it only looks at the code "through a peep hole" and doesn't >really know much about the way it was created, just how it looks on >output. You're coalescing two different concepts. Peephole optimization is a form of bounded lookahead. Many NLG systems that are concerned with text realization can be considered LL(k) transducers. Lookahead is different from rewriting or revising generated text. See for example, Marie Meteer (Vaughan) and David McDonald, "A Model of Revision in Natural Language Generation", ACL-86, pages 90-96, or Marie's more recent paper in Cecile Paris's book. Another paper on this topic is Richard Gabriel's "Deliberate Writing" in McDonald's 1988 book. In it he describes the Yh system, which uses critics and repeated editing to improve the generated text. >I'm sure the work being done by the OZ researchers has considered, or is >considering something similar to this. Yes, this is one of the ideas involved in the GLINDA natural language generation system. See my INLGWS92 paper in the list Peter posted for details. > The problem comes from recognizing 'writing style' enough to make a > collection of patterns that generates writing with a consistent style. True, that's one of the reasons why style is a difficult topic. Just coming up with a good definition of "style" is hard. >Of course, anything more than a simple syntactic level style is a big >problem. >You see mate, I kin tawk funnee an' youz kin get a fee-eel fur mein >siinktaktik stiile but th's ain't no how thee same as me wrrrritin' style. > >I think we would be pushing the current state of IF simply to have >artificial personalities in an adventure which just had differing >syntactic styles. I disagree. Syntactic style and lexical style are equally difficult. GLINDA uses the same mechanisms to handle both. It isn't clear to me what you mean by "syntactic style". The example you give is one of dialectal variation. Dialectal variation involves many factors: particular syntactic organization (e.g., your example starts with a vocative and combines several clauses with conjunctions), lexical selection (e.g., use of "mate", "ain't", "no how"), pronunciation (e.g., "tawk" instead of "talk", "writin'", "thee") and so on. Limiting the generator to just the syntactic level of organization would yield stilted sounding text. --mark Oz Interactive Fiction Project