1097
Comment:
|
4992
|
Deletions are marked like this. | Additions are marked like this. |
Line 1: | Line 1: |
#acl ElmarHaussmann:read,write All:read | #acl ElmarHaussmann:read,write Sabine Storandt:read,write All:read = Welcome to the Wiki of the seminar ''Information Extraction'' in the winter semester 2013 / 2014 = This seminar will be given by [[http://ad.informatik.uni-freiburg.de/staff/haussmann|Elmar Haußmann]] and [[https://ad.informatik.uni-freiburg.de/staff/storandt|Sabine Storandt]], from the chair of [[http://ad.informatik.uni-freiburg.de/staff/bast|Prof. Dr. Hannah Bast]] (who will also attend the sessions). The seminar will take place every Wednesday, 4:00 pm - 6:00 pm ct., in the seminar room 00-007 in building 106. There will be no session on Wednesday, December 25, 2013, on Wednesday, January 1, 2014 (christmas break) and on Wednesday, October 30, 2013. Therefore, depending on the number of talks/participants, we will have at most 14 sessions. |
Line 3: | Line 5: |
= Welcome to the Wiki of the seminar '''Information Extraction''' in the winter semester 2013 / 2014 = | Information Extraction is the task of extracting (structured) information from natural language text. We will cover interesting topics in this area with a focus on current state-of-the-art systems for Open Information Extraction. We will also cover the NLP background required for understanding. |
Line 5: | Line 7: |
This seminar will be given by [[http://ad.informatik.uni-freiburg.de/staff/haussmann|Elmar Haußmann]] and [[https://ad.informatik.uni-freiburg.de/staff/storandt|Sabine Storandt]], from the chair of [[http://ad.informatik.uni-freiburg.de/staff/bast|Prof. Dr. Hannah Bast]] (who will also attend the sessions). | == Topics == The topics and dates will be assigned in the first sessions. Here is a tentative list of topics and systems (each of which will be roughly explained in the first session): |
Line 7: | Line 10: |
Information Extraction is the task of extracting (structured) information from natural language text. We will cover interesting topics in this area with a focus on current state-of-the-art systems. We will also cover the NLP background required for understanding. |
* NLP Basics 1/2 (POS tagging, chunking, NER, NED), Sven Lieber * NLP Basics 2/2 (constituent / dependency parsing, SRL), Sven Lieber * [[https://www.aaai.org/Papers/IJCAI/2007/IJCAI07-429.pdf|TextRunner]] + [[http://www.sciencedirect.com/science/article/pii/S0004370205000366|KnowItAll]], Haixing Gu * [[http://www.aclweb.org/anthology-new/P/P09/P09-1113.pdf|Distant supervision for RE without labeled data]], Toby Peoples * [[http://ai.cs.washington.edu/www/media/papers/Wu-acl10.pdf|OpenIE using Wikipedia]], Soraya Nikousokhan * [[http://rtw.ml.cmu.edu/rtw/|Never Ending Language Learning]] (Demo / Data available), Lukas Veneziano * [[http://ai.cs.washington.edu/www/media/papers/reverb.pdf|ReVerb]] + [[http://turing.cs.washington.edu/papers/etzioni-ijcai2011.pdf|R2A2]] (Code available), Patrick Notz * [[http://turing.cs.washington.edu/papers/emnlp12-mausam.pdf|OLLIE]] (Code available), Max Lotstein * [[http://www.aclweb.org/anthology-new/D/D12/D12-1104.pdf|PATTY]] (Demo available), Ezequiel Primo * [[http://www.cs.umass.edu/~lmyao/papers/univ-schema-tacl.pdf|Relation Extraction With Matrix Factorization]] (Code available), Licón José * [[http://trec.nist.gov/data/qamain.html|TREC Question Answering Track]], Jennifer Nist * [[http://trec.nist.gov/pubs/trec16/papers/lymba.qa.final.pdf|Lymba (QA System)]], Fabian Schillinger * [[http://www.aaai.org/ojs/index.php/aimagazine/article/view/2303|Watson (QA System)]] + [[http://researcher.watson.ibm.com/researcher/files/us-heq/W(11)%20RELATION%20EXTRACTION%2006177734.pdf|Relation Extraction for DeepQA]], Tobias Paxian * [[http://dl.acm.org/citation.cfm?id=2505677|Robust Question Answering over the Web of Linked Data (QA System)]], Simon Skilevic |
Line 10: | Line 25: |
Here is a tentative list of topics and systems (each of which will roughly explained in the first session): POS tagging, text chunking, entity recognition, semantic role labeling, syntax parsing, Textrunner, Knowitall, SRL-IE, Kraken, R2A2, PATTY, SOFIE, PROSPERA, NELL, Reverb, OLLIE, CSD-IE, ClausIE. The topics and dates will be assigned in the first sessions. |
{{{#!html <span style="color:red">We are no longer assigning new topics.</span> }}} == Sessions == 1. Wednesday October 23, 2013: '''Introduction and Topic Assignment''' [[http://ad-teaching.informatik.uni-freiburg.de/information-extraction-ws1314/session-1.pdf|Slides (Introduction + Organization, Topics)]], [[http://ad-teaching.informatik.uni-freiburg.de/InformationExtractionWS1314/get-video.php?file=session-1|Video Recording]], [[http://youtu.be/Bfk5_Z1T7uo|YouTube]] 1. Wednesday October 30, 2013: '''NO SESSION''' 1. Wednesday November 6, 2013: '''Machine Learning Introduction''' [[http://ad-teaching.informatik.uni-freiburg.de/information-extraction-ws1314/session-2.pptx|Slides]], [[http://ad-teaching.informatik.uni-freiburg.de/InformationExtractionWS1314/get-video.php?file=session-2|Video Recording]], [[http://youtu.be/QdnrNGXL0Ms|YouTube]] 1. Wednesday November 13, 2013: '''NO SESSION''' 1. Wednesday November 20, 2013: '''NLP Basics''' 1. Wednesday November 27, 2013: '''!TextRunner, WOE(OpenIE using Wikipedia)''' 1. Wednesday December 4, 2013: '''Reverb, OLLIE''' 1. Wednesday December 11, 2013: '''NELL, PATTY''' 1. Wednesday December 18, 2013: '''Distant supervision for RE, RE With Matrix Factorization, ''' 1. Wednesday January 8, 2013: '''TREC Question Answering Track, Lymba (QA System), ''' 1. Wednesday January 15, 2013: '''Robust Question Answering over the Web of Linked Data (QA System), Watson (QA System)''' 1. Wednesday January 22, 2013: '''tbd''' 1. Wednesday January 29, 2013: '''tbd''' 1. Wednesday February 5, 2013: '''tbd''' 1. Wednesday February 12, 2013: '''tbd''' Attention: Due to illness or assignment of further topics the dates might be postponed by a week or two. We will inform you about changes via email over the Daphne system (so please register), and update this webpage as soon as possible if any shifts occur. |
Welcome to the Wiki of the seminar ''Information Extraction'' in the winter semester 2013 / 2014
This seminar will be given by Elmar Haußmann and Sabine Storandt, from the chair of Prof. Dr. Hannah Bast (who will also attend the sessions). The seminar will take place every Wednesday, 4:00 pm - 6:00 pm ct., in the seminar room 00-007 in building 106. There will be no session on Wednesday, December 25, 2013, on Wednesday, January 1, 2014 (christmas break) and on Wednesday, October 30, 2013. Therefore, depending on the number of talks/participants, we will have at most 14 sessions.
Information Extraction is the task of extracting (structured) information from natural language text. We will cover interesting topics in this area with a focus on current state-of-the-art systems for Open Information Extraction. We will also cover the NLP background required for understanding.
Topics
The topics and dates will be assigned in the first sessions. Here is a tentative list of topics and systems (each of which will be roughly explained in the first session):
- NLP Basics 1/2 (POS tagging, chunking, NER, NED), Sven Lieber
- NLP Basics 2/2 (constituent / dependency parsing, SRL), Sven Lieber
TextRunner + KnowItAll, Haixing Gu
Distant supervision for RE without labeled data, Toby Peoples
OpenIE using Wikipedia, Soraya Nikousokhan
Never Ending Language Learning (Demo / Data available), Lukas Veneziano
OLLIE (Code available), Max Lotstein
PATTY (Demo available), Ezequiel Primo
Relation Extraction With Matrix Factorization (Code available), Licón José
TREC Question Answering Track, Jennifer Nist
Lymba (QA System), Fabian Schillinger
Watson (QA System) + Relation Extraction for DeepQA, Tobias Paxian
Robust Question Answering over the Web of Linked Data (QA System), Simon Skilevic
We are no longer assigning new topics.
Sessions
Wednesday October 23, 2013: Introduction and Topic Assignment Slides (Introduction + Organization, Topics), Video Recording, YouTube
Wednesday October 30, 2013: NO SESSION
Wednesday November 6, 2013: Machine Learning Introduction Slides, Video Recording, YouTube
Wednesday November 13, 2013: NO SESSION
Wednesday November 20, 2013: NLP Basics
Wednesday November 27, 2013: TextRunner, WOE(OpenIE using Wikipedia)
Wednesday December 4, 2013: Reverb, OLLIE
Wednesday December 11, 2013: NELL, PATTY
Wednesday December 18, 2013: Distant supervision for RE, RE With Matrix Factorization,
Wednesday January 8, 2013: TREC Question Answering Track, Lymba (QA System),
Wednesday January 15, 2013: Robust Question Answering over the Web of Linked Data (QA System), Watson (QA System)
Wednesday January 22, 2013: tbd
Wednesday January 29, 2013: tbd
Wednesday February 5, 2013: tbd
Wednesday February 12, 2013: tbd
Attention: Due to illness or assignment of further topics the dates might be postponed by a week or two. We will inform you about changes via email over the Daphne system (so please register), and update this webpage as soon as possible if any shifts occur.