Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficiently incorporating user feedback into information extraction and integration programs

X. Chai, B. Vuong, A. Doan, und J. Naughton. Proceedings of the 35th SIGMOD international conference on Management of data, Seite 87--100. New York, NY, USA, ACM, (2009)

Zusammenfassung

Many applications increasingly employ information extraction and integration (IE/II) programs to infer structures from unstructured data. Automatic IE/II are inherently imprecise. Hence such programs often make many IE/II mistakes, and thus can significantly benefit from user feedback. Today, however, there is no good way to automatically provide and process such feedback. When finding an IE/II mistake, users often must alert the developer team (e.g., via email or Web form) about the mistake, and then wait for the team to manually examine the program internals to locate and fix the mistake, a slow, error-prone, and frustrating process. In this paper we propose a solution for users to directly provide feedback and for IE/II programs to automatically process such feedback. In our solution a developer U uses hlog, a declarative IE/II language, to write an IE/II program P. Next, U writes declarative user feedback rules that specify which parts of P's data (e.g., input, intermediate, or output data) users can edit, and via which user interfaces. Next, the so-augmented program P is executed, then enters a loop of waiting for and incorporating user feedback. Given user feedback F on a data portion of P, we show how to automatically propagate F to the rest of P, and to seamlessly combine F with prior user feedback. We describe the syntax and semantics of hlog, a baseline execution strategy, and then various optimization techniques. Finally, we describe experiments with real-world data that demonstrate the promise of our solution.

Links und Ressourcen

URL:

http://doi.acm.org/10.1145/1559845.1559857

BibTeX-Schlüssel:

chai2009efficiently

Suchen auf:

Kommentare und Rezensionen
(0)

Es gibt bisher keine Rezension oder Kommentar. Sie können eine schreiben!

Zitieren Sie diese Publikation

@inproceedings{chai2009efficiently,
 abstract = {Many applications increasingly employ information extraction and integration (IE/II) programs to infer structures from unstructured data. Automatic IE/II are inherently imprecise. Hence such programs often make many IE/II mistakes, and thus can significantly benefit from user feedback. Today, however, there is no good way to automatically provide and process such feedback. When finding an IE/II mistake, users often must alert the developer team (e.g., via email or Web form) about the mistake, and then wait for the team to manually examine the program internals to locate and fix the mistake, a slow, error-prone, and frustrating process. In this paper we propose a solution for users to directly provide feedback and for IE/II programs to automatically process such feedback. In our solution a developer U uses hlog, a declarative IE/II language, to write an IE/II program P. Next, U writes declarative user feedback rules that specify which parts of P's data (e.g., input, intermediate, or output data) users can edit, and via which user interfaces. Next, the so-augmented program P is executed, then enters a loop of waiting for and incorporating user feedback. Given user feedback F on a data portion of P, we show how to automatically propagate F to the rest of P, and to seamlessly combine F with prior user feedback. We describe the syntax and semantics of hlog, a baseline execution strategy, and then various optimization techniques. Finally, we describe experiments with real-world data that demonstrate the promise of our solution.},
 acmid = {1559857},
 added-at = {2012-06-19T17:05:26.000+0200},
 address = {New York, NY, USA},
 author = {Chai, Xiaoyong and Vuong, Ba-Quy and Doan, AnHai and Naughton, Jeffrey F.},
 biburl = {https://puma.uni-kassel.de/bibtex/2d6c9fbf442a935dc0618107f8fb54d44/jaeschke},
 booktitle = {Proceedings of the 35th SIGMOD international conference on Management of data},
 doi = {10.1145/1559845.1559857},
 interhash = {5860215447e374b059597c0e3864e388},
 intrahash = {d6c9fbf442a935dc0618107f8fb54d44},
 isbn = {978-1-60558-551-2},
 keywords = {cirg collective computing extraction human ie information intelligence toread},
 location = {Providence, Rhode Island, USA},
 numpages = {14},
 pages = {87--100},
 publisher = {ACM},
 timestamp = {2012-06-19T17:05:27.000+0200},
 title = {Efficiently incorporating user feedback into information extraction and integration programs},
 url = {http://doi.acm.org/10.1145/1559845.1559857},
 year = 2009
}

%0 Conference Paper
%1 chai2009efficiently
%A Chai, Xiaoyong
%A Vuong, Ba-Quy
%A Doan, AnHai
%A Naughton, Jeffrey F.
%B Proceedings of the 35th SIGMOD international conference on Management of data
%C New York, NY, USA
%D 2009
%I ACM
%K cirg collective computing extraction human ie information intelligence toread
%P 87--100
%R 10.1145/1559845.1559857
%T Efficiently incorporating user feedback into information extraction and integration programs
%U http://doi.acm.org/10.1145/1559845.1559857
%X Many applications increasingly employ information extraction and integration (IE/II) programs to infer structures from unstructured data. Automatic IE/II are inherently imprecise. Hence such programs often make many IE/II mistakes, and thus can significantly benefit from user feedback. Today, however, there is no good way to automatically provide and process such feedback. When finding an IE/II mistake, users often must alert the developer team (e.g., via email or Web form) about the mistake, and then wait for the team to manually examine the program internals to locate and fix the mistake, a slow, error-prone, and frustrating process.</p> <p>In this paper we propose a solution for users to directly provide feedback and for IE/II programs to automatically process such feedback. In our solution a developer <i>U</i> uses hlog, a declarative IE/II language, to write an IE/II program <i>P</i>. Next, <i>U</i> writes declarative user feedback rules that specify which parts of <i>P</i>'s data (e.g., input, intermediate, or output data) users can edit, and via which user interfaces. Next, the so-augmented program <i>P</i> is executed, then enters a loop of waiting for and incorporating user feedback. Given user feedback <i>F</i> on a data portion of <i>P</i>, we show how to automatically propagate <i>F</i> to the rest of <i>P</i>, and to seamlessly combine <i>F</i> with prior user feedback. We describe the syntax and semantics of hlog, a baseline execution strategy, and then various optimization techniques. Finally, we describe experiments with real-world data that demonstrate the promise of our solution.
%@ 978-1-60558-551-2

PUMA

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficiently incorporating user feedback into information extraction and integration programs

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen
(0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@jaeschkes Tags hervorgehoben)

PUMA

KopierenLöschenDiese Publikation zur Ablage hinzufügenCommunity-EintragVersionsverlauf dieses EintragsURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Efficiently incorporating user feedback into information extraction and integration programs

Zusammenfassung

Links und Ressourcen

Kommentare und Rezensionen (0)

Tags

Zitieren Sie diese Publikation

Metadaten

Community

Tags (@jaeschkes Tags hervorgehoben)

Kopieren Löschen Diese Publikation zur Ablage hinzufügen
Community-Eintrag
Versionsverlauf dieses Eintrags
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Efficiently incorporating user feedback into information extraction and integration programs

Kommentare und Rezensionen
(0)