This is the work done by me and my friend (Behera) as my undergraduate project. Being from the Electrical department and doing the final year project in computer science department was no easy task. Thankfully our supervisor Dr Sudeshna Sarkar was very helpful and the professors from Electrical department too ignorant to understand what I was talking about in the presentation. We did a good amount of work and the work was conferred best undergraduate project is systems award! Here goes the abstract from the thesis - A question answering (QA) system provides direct answers to user questions by consulting its knowledge base. Since the early days of artificial intelligence in the 60’s, researchers have been fascinated with answering natural language questions. However, the difficulty of natural language processing (NLP) has limited the scope of QA to domain-specific expert systems. In recent years, the combination of web growth, improvements in information technology, and the explosive demand for better information access has reignited the interest in QA systems. The wealth of information on the web makes it an attractive resource for seeking quick answers to simple, factual questions such as “who was the first American in space?” or “what is the second tallest mountain in the world?” Yet today’s most advanced web search services (e.g., Google, Yahoo, MSN live search and AskJeeves) make it surprisingly tedious to locate answers to such questions. Question answering aims to develop techniques that can go beyond the retrieval of relevant Documents in order to return exact answers to natural language factoid questions, such as “Who is the first woman to be in space?”, “Which is the largest city in India?”, and “When was first world war fought?”. Answering natural language questions requires more complex processing of text than employed by current information retrieval systems. This thesis investigates a number of techniques for performing open-domain factoid question answering. We have developed an architecture that augments existing search engines so that they support natural language question answering and is also capable of supporting local corpus as a knowledge base. Our system currently supports document retrieval from Google and Yahoo via their public search engine application programming interfaces (APIs). We assumed that all the information required to produce an answer exists in a single sentence and followed a pipelined approach towards the problem. Various stages in the pipeline include: automatically constructed question type analysers based on various classifier models, document retrieval, passage extraction, phrase extraction, sentence and answer ranking. We developed and analyzed different sentence and answer ranking algorithms, starting with simple ones that employ surface matching text patterns to more complicated ones using root words, part of speech (POS) tags and sense similarity metrics. The thesis also presents a feasibility analysis of our system to be used in real time QA applications. You can download the thesis here Everything is written in JAVA. The source code is huge ~100Mbs so I'm not posting them.

Comparison of our system with existing ones

Print This Post Print This Post
Tagged with:  
  • Duy Le

    Hi Amiya ,My name is Duy Le and I am from VietNam. I’m a student of the University of Sciences, Ho Chi Minh city. Currently, I’m studying about Q&A system and apply QA to Vietnamese. I have read your thesis, it’s really really very good. Your work is so impressive to me. 
    I have some concerns, could you plesae help? 

    In your thesis, you wrote that  
    “Figure 2.4: JAVA Question Classifier, can be downloaded for evaluation from
                  http://www.cybergeeks.co.in/projects.php?id=10
    Howerver, when I accessed the link, it is not found. Can I find this somwhere elese?

    I’m coding my system for English first and then change to Vietnamese but I do not have any QA system (in English) as a base line for comparison. I searched in the Internet and I found yours. Could you please give me your execuable program so that I  can test and compare with my results?

    Thanks so much,
    Duy

  • Duy Le

    Hi Amiya ,My name is Duy Le and I am from VietNam. I’m a student of the University of Sciences, Ho Chi Minh city. Currently, I’m studying about Q&A system and apply QA to Vietnamese. I have read your thesis, it’s really really very good. Your work is so impressive to me. 
    I have some concerns, could you plesae help? 

    In your thesis, you wrote that  
    “Figure 2.4: JAVA Question Classifier, can be downloaded for evaluation from
                  http://www.cybergeeks.co.in/projects.php?id=10
    Howerver, when I accessed the link, it is not found. Can I find this somwhere elese?

    I’m coding my system for English first and then change to Vietnamese but I do not have any QA system (in English) as a base line for comparison. I searched in the Internet and I found yours. Could you please give me your execuable program so that I  can test and compare with my results?

    Thanks so much,
    Duy

    • Anonymous

      I’m glad it was of help to you. The links must be outdated, I’ll try to find the code and send it to you. Hope that helps.

      • Duy Le

        Thanks Amiya!
        If you have it, please help to sent it to me. My email is lnbduy@gmail.com
         

  • Duy Le

    Hi Amiya,

    I’m implementing question classification and need annotated corpus for training/testing.  I know that you used corpus Tagged Question Corpus, Cognitive Computation Group at the Department of Computer Science, UIUC. I tried to access this corpus, however, it asked me to log in with a required NetId which I cannot have. Do you know how to get it from the Internet? If so, could you please provide me the link?If you have it, could you please help to send it to me?

    Thank you so much!

    Best regards,
    Duy

    • Duy Le

      Hi Amiya,

      I think it is http://cogcomp.cs.illinois.edu/Data/QA/QC/

      Is this right?

      Regards,
      Duy

    • Duy Le

      Hi Amiya,

      I think it is http://cogcomp.cs.illinois.edu/Data/QA/QC/

      Is this right?

      Regards,
      Duy

  • Duy Le

    Hi Amiya,

    I’m implementing question classification and need annotated corpus for training/testing.  I know that you used corpus Tagged Question Corpus, Cognitive Computation Group at the Department of Computer Science, UIUC. I tried to access this corpus, however, it asked me to log in with a required NetId which I cannot have. Do you know how to get it from the Internet? If so, could you please provide me the link?If you have it, could you please help to send it to me?

    Thank you so much!

    Best regards,
    Duy

  • Manivannan D

    HI AMIYA
    i’m pursuing b-tech in sri venkateswara university. as final year project I would like to implement “a factoid question answer system on web”. now I need to create a interface between my web page and DBpedia website . can I get the interface java code.

Looking for something?

Use the form below to search the site:

Still not finding what you're looking for? Drop a comment on a post or contact us so we can take care of it!

Visit our friends!

A few highly recommended friends...

Archives