International Journal of Advanced Computer Research (IJACR) ISSN (P): 2249-7277 ISSN (O): 2277-7970 Vol - 3, Issue - 9, March 2013
  1. 1
    Google Scholar
  2. 4
    Impact Factor
Extracting Person Name, Date and Place from Text Documents Using LEX Tool

Roohi Sharma

Abstract

This paper contains the details of how one can extract person name, date and place from a text document using finite state automata and LEX tool. If we search a text document for some important information manually, the process is slow, tedious and error prone. The regular expressions are used to parse textual data to match patterns and extract variables. The lexical analyzer is used in this research, which scans the input program character by character and groups them together to form tokens. This paper describes a technique to perform identification and extraction of information by using LEX tool. It finds the names, date and places that appear in machine-readable text document. Regular expressions through which required information is extracted are also discussed.

Keyword

Regular Expressions, Finite State Automata, Information Extraction, Pattern Matching, Lexical Analyzer.

Cite this article

Refference