In my work, I claim that there is indeed a need for various kinds of visual queries, and that systems for image retrieval should support these in some manner. Several different surveys of CBIR systems have been made, covering both different systems and different aspects of these systems. However, I was not able to find any surveys with a special focus on techniques for querying these systems. The 2000 survey made by Veltkamp and Tanase includes a section on query specification techniques, but this has not been summarized to any degree. Consequently, I decided to do a survey of the techniques available in these systems. I’ve surveyed 58 systems from the past 13 years.
This has not been a very large part of my work, and I’m not claiming that this is a comprehensive study, that I have included all systems or that this is a very in-depth analysis of query techniques, but I think I’ve been able to at least present a summary of what techniques have been used in CBIR systems. I’ve identified 6 main categories of query specification techniques. These are by no means new categories and I think all of them have been described previously. I’ve only created my own descriptions of them:
- Query by Internal Example (QBIE). These systems allow queries based on images already present in the image database. Queries are made by selecting one or more images from the collection and using these as a basis for a similarity search. This type of query can either represent an initial query to the system, or be initiated based on the results obtained from another query. A number of the systems using this approach present the user with small set of sample images, representing the topics or image types present in the collection. One recent example of the QBIE approach is the FIRE system.
- Query by External Example (QBEE). These systems allow queries based on images external to the image database. Queries are made by submitting one or more images to the system and using these as a basis for similarity search. One recent example of this approach is the Retrievr system.
- Query by Spatial Composition (QBSC). These systems allow the user to compose a query image representing their information needs using one or more drawing tools. This query type is also sometimes known as Query by Sketch. A recent example of this approach is the Retrievr system.
- Query by Features (QBF). These systems allow queries based on specification of low level features. Queries are made by defining and manipulating colour histograms, creating or selecting texture samples or other feature specification techniques. One example of this approach is the MARS system.
- Query by Image Area (QBA). These systems allow queries based on selection of an image segment. Queries are expressed by selecting a section of an image (Internal or external) and using this selection as a basis for a similarity search. One example of this approach is the BLOBWORLD system.
- Query by Text (QBT). These systems allow text-based queries. Queries are either expressed as keywords or through selection of collection categories. These queries are normally used as a method of initiating a search, providing the user with an initial set of images which can be used in a QBE search. One recent example of this approach is the CORTINA system.
Most of the surveyed systems support one (21) or two (25) query of these techniques. Only 8 systems support 3 specification techniques.
| Type |
Number |
Percentage |
| QBIE |
36 |
62,07 % |
| QBT |
17 |
29,31 % |
| QBEE |
16 |
27,59 % |
| QBSC |
11 |
18,97 % |
| QBF |
11 |
18,97 % |
| QBA |
7 |
12,07 % |
The most used technique is Query by Internal Example, both alone and in combination with other techniques. In a majority of the systems combining QBIE with other techniques, these other techniques are often used as a point-of-entry to the system. The initial query to the system is often expressed the other techniques, while QBIE is used to either refine the query using a relevance-feedback loop, or to initiate a new query based on one or more of the retrieved images. For example, in the AMORE system the user may select a category of images through a textual label (e.g. “arts” or “travel”), or by choosing a random set of images.
Only 11 of the systems support the type of queries I’m concerned, queries by spatial composition. For these systems, I’ve tried to identify and classify the tools the user has available when composing the queries. 5 different categories of tools for composing queries using QBSC have been identified:
- Freehand drawing (F). These systems allow visual query specification through the use of freehand drawing. This refers to the use of a mouse (or similar tactile input devices) to create an interface similar to drawing using pen and paper. This allows the user a high degree of freedom to express any types of content, only limited to the user’s level of skill in freehand drawing.
- Colour specification (C). These systems allow visual query specification through the use of colours in combination with other tools. This allows the user to specify which colours should be present in the query image, as well as the spatial distribution of the colour.
- Geometric Primitives (GP). These systems allow visual query specification through building a query image using geometric primitives such as circles, squares and lines. This allows the user to build the spatial composition of the query using such shapes.
- Shape Prototypes (SP). These systems allow visual query specification through the use of example shapes or shape prototypes, representing real-world objects. This allows the user to use these shape prototypes to spatially arrange the query participants within the query image.
- Texture (T). These systems allow visual query specification through the use of texture samples or texture specification tools. This allows the user to express which textures should be present in an image, as well as specify the spatial arrangement of these textures.
| System |
Tool types |
||||
| |
F |
C |
GP |
SP |
T |
| WISE |
x |
x |
x |
||
| Picasso |
x |
x |
x |
||
| CHROMA |
x |
x |
|||
| DrawSearch |
x |
x |
|||
| Retrievr |
x |
x |
|||
| VP Image Retrieval system |
x |
x |
|||
| ImageScape |
x |
x |
|||
| QBIC |
x |
x |
x |
||
| Herminate Museum |
x |
x |
|||
| NETRA |
x |
x |
|||
| VisualSeek |
|
x |
|
|
x |
| Total (11 systems) |
7 |
9 |
4 |
1 |
4 |
I’ve used these findings in a discussion on the expressive convenience of visual queries and image retrieval. I’ll post more in this in the near future.



















