Recognition and quality assessment of data charts in mixed-mode documents

Sudhindra Shukla, Ashok K Samal

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Data charts can be used to effectively compress large amounts of complex information and can convey information in an efficient and succinct manner. It is now easier to create data charts by using a variety of automated software systems. These data charts are routinely inserted in text documents and are widely disseminated over many different media. This study addresses the problem of finding goodness of data charts in mixed-mode documents. The quality of the graphics can be used to assist the document development process as well as to serve as an additional criterion for search engines like Google and Yahoo. The quality measures are motivated by principles of visual learning and are based on research in educational psychology and cognitive theories and use attributes of both the graphic and its textual context. We have implemented the approach and evaluated its effectiveness using a set of documents compiled from the Web. Results of a human study shows that the proposed quality measures have a high correlation with the quality ratings of the users for each of the five classes of data charts studied in this research.

Original languageEnglish (US)
Pages (from-to)111-126
Number of pages16
JournalInternational Journal on Document Analysis and Recognition
Volume11
Issue number3
DOIs
StatePublished - Oct 3 2008

Fingerprint

Search engines

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Computer Science Applications

Cite this

Recognition and quality assessment of data charts in mixed-mode documents. / Shukla, Sudhindra; Samal, Ashok K.

In: International Journal on Document Analysis and Recognition, Vol. 11, No. 3, 03.10.2008, p. 111-126.

Research output: Contribution to journalArticle

@article{44cde81a6a364c1283729a55692bfa70,
title = "Recognition and quality assessment of data charts in mixed-mode documents",
abstract = "Data charts can be used to effectively compress large amounts of complex information and can convey information in an efficient and succinct manner. It is now easier to create data charts by using a variety of automated software systems. These data charts are routinely inserted in text documents and are widely disseminated over many different media. This study addresses the problem of finding goodness of data charts in mixed-mode documents. The quality of the graphics can be used to assist the document development process as well as to serve as an additional criterion for search engines like Google and Yahoo. The quality measures are motivated by principles of visual learning and are based on research in educational psychology and cognitive theories and use attributes of both the graphic and its textual context. We have implemented the approach and evaluated its effectiveness using a set of documents compiled from the Web. Results of a human study shows that the proposed quality measures have a high correlation with the quality ratings of the users for each of the five classes of data charts studied in this research.",
author = "Sudhindra Shukla and Samal, {Ashok K}",
year = "2008",
month = "10",
day = "3",
doi = "10.1007/s10032-008-0065-5",
language = "English (US)",
volume = "11",
pages = "111--126",
journal = "International Journal on Document Analysis and Recognition",
issn = "1433-2833",
publisher = "Springer Verlag",
number = "3",

}

TY - JOUR

T1 - Recognition and quality assessment of data charts in mixed-mode documents

AU - Shukla, Sudhindra

AU - Samal, Ashok K

PY - 2008/10/3

Y1 - 2008/10/3

N2 - Data charts can be used to effectively compress large amounts of complex information and can convey information in an efficient and succinct manner. It is now easier to create data charts by using a variety of automated software systems. These data charts are routinely inserted in text documents and are widely disseminated over many different media. This study addresses the problem of finding goodness of data charts in mixed-mode documents. The quality of the graphics can be used to assist the document development process as well as to serve as an additional criterion for search engines like Google and Yahoo. The quality measures are motivated by principles of visual learning and are based on research in educational psychology and cognitive theories and use attributes of both the graphic and its textual context. We have implemented the approach and evaluated its effectiveness using a set of documents compiled from the Web. Results of a human study shows that the proposed quality measures have a high correlation with the quality ratings of the users for each of the five classes of data charts studied in this research.

AB - Data charts can be used to effectively compress large amounts of complex information and can convey information in an efficient and succinct manner. It is now easier to create data charts by using a variety of automated software systems. These data charts are routinely inserted in text documents and are widely disseminated over many different media. This study addresses the problem of finding goodness of data charts in mixed-mode documents. The quality of the graphics can be used to assist the document development process as well as to serve as an additional criterion for search engines like Google and Yahoo. The quality measures are motivated by principles of visual learning and are based on research in educational psychology and cognitive theories and use attributes of both the graphic and its textual context. We have implemented the approach and evaluated its effectiveness using a set of documents compiled from the Web. Results of a human study shows that the proposed quality measures have a high correlation with the quality ratings of the users for each of the five classes of data charts studied in this research.

UR - http://www.scopus.com/inward/record.url?scp=57349187934&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=57349187934&partnerID=8YFLogxK

U2 - 10.1007/s10032-008-0065-5

DO - 10.1007/s10032-008-0065-5

M3 - Article

AN - SCOPUS:57349187934

VL - 11

SP - 111

EP - 126

JO - International Journal on Document Analysis and Recognition

JF - International Journal on Document Analysis and Recognition

SN - 1433-2833

IS - 3

ER -