We are soliciting papers describing algorithms and systems in all aspects of document recognition and retrieval, for offline, online and Web documents. Specifically, we are encouraging papers enlarging the frontiers and dealing for instance with multi-modal information such as online and offline, or speech and handwriting, etc.
Recognizing handwritten or degraded machine printed documents (e.g. faxed or historical documents) remains a challenging problem. Document recognition and understanding are concerned with fully reconstructing a document in electronic form consistent with the original format (fonts, layout etc.) and capturing the underlying logical structure (e.g. section headings, region types, references, and reading order) of the document. Among the challenges for machine-printed documents are complex layouts (text written on images, complex backgrounds, etc.), degraded and noisy documents, and robust recognition of tables and equations. Handwritten documents with unconstrained writing style pose additional challenges due to increased variability and segmentation ambiguities. Non-textual elements in documents form another class of interesting problems. These include the extraction and recognition of logos and signatures, and the conversion of line drawings in documents from raster to vector format. Web documents pose both similar and new challenges.
One of the primary reasons for digitizing existing paper materials is to simplify retrieval and organization of information. In this regard we are particularly interested in papers which address any of the following issues: retrieval in the presence of noise; retrieval based on sketches, images, tables, diagrams or other non-linguistic objects that appear in the document; retrieval based on text appearing with non-standard alignment, in images or graphics; recognition and tagging of mathematical arrays and equations which serve as indicators of subject content or methodology used in the document; novel methods for retrieval and organization of information based on text or other information in a document. Papers addressing retrieval-specific issues are encouraged to use standard performance metrics such as ROC and precision-recall curves.
Topics:
Papers are solicited in, but not limited to, the
following areas:
Document Recognition
- Text recognition: machine-printed, handwritten documents; paper, tablet, camera, video sources
- Writer/style identification, verification,
adaptation
- Graphics recognition: vectorization (e.g. for
line-art, maps and technical drawings), signature, logo and graphical
symbol recognition, figure, chart and graph recognition, diagrammatic
notations (e.g. music, mathematical notation)
- Document layout analysis and understanding:
document and page region segmentation, form and table recognition,
document understanding through combined modalities (e.g. speech and
images)
- Evaluation: performance metrics, document degradation models
- Additional topics: document image filtering,
enhancement and compression, document clustering and classification,
machine learning (e.g. integration and optimization of recognition
modules), historical and degraded document images (e.g. fax), multilingual
document recognition, web page analysis (including wikis and blogs)
Document Retrieval
- Indexing and Summarization : (noisy) text
documents (messages, blogs, etc.), imaged documents, entity tagging from
OCR'ed text, text categorization
- Query Languages and Modalities: Content-Based
Image Retrieval (CBIR) for documents, keyword spotting, non-textual
query-by-example (e.g. tables, figures, math), querying by document
geometry and/or logical structure, approximate string matching algorithms
for OCR'ed text, retrieval of noisy text documents (messages, blogs,
etc.), cross and multi- lingual retrieval
- Evaluation: relevance and performance
metrics, evaluation protocols, benchmarking
- Additional topics: relevance feedback, impact
of recognition accuracy on retrieval performance, and digital libraries
including systems engineering and quality assurance
Program Committee:
Conference
Chairs:
Christian Viard-Gaudin, Univ. of Nantes (France);
Richard Zanibbi, Rochester Institute of Technology (USA)
Program
Committee:
Gady Agam, Illinois Institute of Technology;
Elisa Barney Smith, Boise State Univ.;
Bill Barrett, Brigham Young Univ.;
Kathrin Berkner, Ricoh Innovations, Inc.;
Bertrand Couasnon, IRISA/INSA Rennes (France);
Hervé Dejean, Xerox Research (France);
Xiaoqing Ding, Tsinghua Univ. (China);
David Doermann, Univ.of Maryland/College Park;
Oleg Golubitsky, Google, Inc.;
Jianying Hu, IBM Thomas J. Watson Research Ctr.;
Laurence Likforman-Sulem, Telecom ParisTech (France);
Marcus Liwicki, DFKI (Germany);
Xiaofan Lin, Vobile Inc;
Daniel Lopresti, Lehigh Univ.;
Hiroshi Sako, Hitachi,Ltd. (Japan);
Sargur Srihari, Univ. at Buffalo;
Venkata Subramaniam, IBM India Research Lab. (India);
Kazem Taghva, Univ. of Nevada/Las Vegas;
George Thoma, National Library of Medicine;
Berrin Yanikoglu, Sabanci Univ. (Turkey);
Jie Zou, National Library of Medicine
Submission information:
Submissions
to Document Recognition and Retrieval XIX should contain final papers (minimum: 6 pages, maximum: 8 pages).
Submissions should be informative, describe the problem that is
addressed by the paper, the original contribution in the paper, the way it
relates to existing work, and provide experimental/theoretical evaluation.
Final manuscripts to be published in the proceedings are expected to be at
least 6 pages long. Submissions
should be made through the Electronic Imaging
website. In addition to the paper itself, a 200 word summary will be required.
Full-time students that are the primary authors of
a paper and that wish to be considered for the Best Student Paper Award should
consult the competition rules.
Questions concerning the conference could be addressed to: ![]()
Important dates:
July 11, 2011:
July 29, 2011
Submission deadline
September 12, 2011:
Official decisions
November 14, 2011:
Final manuscripts due
Links:
- Preliminary program
- Final program
- Submission website
- SPIE conference website
- Call for papers (A4 pdf)
- Best student paper competition
- Latex templates
- Past conferences
- Past
proceedings
- DBLP
index
- Electronic imaging
News:
Invited speakers :