ISCANIT: Recognising Intention in Real-Time
for Visually Mediated Interaction

The aim of the ISCANIT project is to undertake research supporting Visually Mediated Interaction (VMI) and to advance the state of the art by developing generic view-based head and body behavioural models. These models will be used to recognise intentions for active camera control. We propose to develop active computer vision systems which perform dynamic scene interpretation in terms of subjects' behaviour and intention to mediate interaction. A prototype system will be built to perform real-time tracking of multiple people and behavioural analysis of several individuals (at most three simultaneously) within typical indoor office or home environments. The main objectives for evaluation of the project are, therefore:

1. The investigation and development of a computational paradigm based upon active cameras that will enable on-line capture and interpretation of dynamic scenes containing live actions, which can also be used to capture the behaviour of users.

2. The development of body movement and gesture models and the use of such models for user-specific interpretation of behaviour and intention.

3. The development of intentional tracking techniques that exploit the behavioural models above by estimating and predicting expected head and body movement and primitive gestures in real-time.

4. The development of switching of attention mechanisms for active tracking of several individuals with fewer cameras than individuals.

Resources

People Involved in the ISCANIT Project

This EPSRC-funded 2-year project is a collaboration between the School of Cognitive and Computing Sciences at the University of Sussex (COGS) and the Department of Computer Science, Queen Mary and Westfield College, University of London.

University of Sussex

QMW

Associates:

University of Dundee

Publications arising out of the ISCANIT Project

2000

1999

1998

Phase I Data: Gesture Image Sequence Database

A major part of the project will be concerned with the collection of a substantial image sequence database.

Phase I: single-person gesture sequences

Example 60-frame 'pointing left' sequence.

As above, after background subtraction.

As above, after frame differencing.

As above, after background subtraction and frame differencing.

Phase II Data: Group Interaction Image Sequence Database

Back to COGS Vision Home Page

Back to QMW Vision Home Page

Back to COGS Home Page

24 August 2000
Jonathan Howell, jonh@sussex.ac.uk

hits since July 1999