A Computer Vision Integration Model for a Multi-modal Cognitive System

A Vrecko; D Skocaj; Nicholas Hawes; A Leonardis

doi:10.1109/IROS.2009.5354358

A Computer Vision Integration Model for a Multi-modal Cognitive System

A Vrecko, D Skocaj, Nicholas Hawes, A Leonardis

Computer Science

Research output: Contribution to conference (unpublished) › Paper › peer-review

10 Citations (Scopus)

208 Downloads (Pure)

Abstract

We present a general method for integrating visual components into a multi-modal cognitive system. The integration is very generic and can combine an arbitrary set of modalities. We illustrate our integration approach with a specific instantiation of the architecture schema that focuses on integration of vision and language: a cognitive system able to collaborate with a human, learn and display some understanding of its surroundings. As examples of cross-modal interaction we describe mechanisms for clarification and visual learning.

Original language	English
Pages	3140-3147
Number of pages	8
DOIs	https://doi.org/10.1109/IROS.2009.5354358
Publication status	Published - 15 Dec 2009
Event	Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009 (IROS 2009) - Duration: 15 Dec 2009 → …

Conference

Conference	Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009 (IROS 2009)
Period	15/12/09 → …

Bibliographical note

null

Accepted at 2009 IEEE/RSJ International Conference on Intelligent RObots and Systems

Access to Document

10.1109/IROS.2009.5354358

vreckoetal09iros.pdf

Cite this

@conference{18f216dc209f4039bf0848dc5c237108,

title = "A Computer Vision Integration Model for a Multi-modal Cognitive System",

abstract = "We present a general method for integrating visual components into a multi-modal cognitive system. The integration is very generic and can combine an arbitrary set of modalities. We illustrate our integration approach with a specific instantiation of the architecture schema that focuses on integration of vision and language: a cognitive system able to collaborate with a human, learn and display some understanding of its surroundings. As examples of cross-modal interaction we describe mechanisms for clarification and visual learning.",

author = "A Vrecko and D Skocaj and Nicholas Hawes and A Leonardis",

note = "null Accepted at 2009 IEEE/RSJ International Conference on Intelligent RObots and Systems; Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009 (IROS 2009) ; Conference date: 15-12-2009",

year = "2009",

month = dec,

day = "15",

doi = "10.1109/IROS.2009.5354358",

language = "English",

pages = "3140--3147",

}

TY - CONF

T1 - A Computer Vision Integration Model for a Multi-modal Cognitive System

AU - Vrecko, A

AU - Skocaj, D

AU - Hawes, Nicholas

AU - Leonardis, A

N1 - null Accepted at 2009 IEEE/RSJ International Conference on Intelligent RObots and Systems

PY - 2009/12/15

Y1 - 2009/12/15

N2 - We present a general method for integrating visual components into a multi-modal cognitive system. The integration is very generic and can combine an arbitrary set of modalities. We illustrate our integration approach with a specific instantiation of the architecture schema that focuses on integration of vision and language: a cognitive system able to collaborate with a human, learn and display some understanding of its surroundings. As examples of cross-modal interaction we describe mechanisms for clarification and visual learning.

AB - We present a general method for integrating visual components into a multi-modal cognitive system. The integration is very generic and can combine an arbitrary set of modalities. We illustrate our integration approach with a specific instantiation of the architecture schema that focuses on integration of vision and language: a cognitive system able to collaborate with a human, learn and display some understanding of its surroundings. As examples of cross-modal interaction we describe mechanisms for clarification and visual learning.

U2 - 10.1109/IROS.2009.5354358

DO - 10.1109/IROS.2009.5354358

M3 - Paper

SP - 3140

EP - 3147

T2 - Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems, 2009 (IROS 2009)

Y2 - 15 December 2009

ER -

A Computer Vision Integration Model for a Multi-modal Cognitive System

Abstract

Conference

Bibliographical note

Access to Document

Fingerprint

Cite this