Classic Content-Based Image Retrieval (CBIR) takes a single non-annotated query image, and retrieves similar images from an image repository. Such a search must rely upon a holistic (or global) view of the image. Yet often the desired content of an image is not holistic, but is localized. Specifically, we define Localized Content-Based Image Retrieval as a CBIR task where the user is only interested in a portion of the image, and the rest of the image is irrelevant. ACCIO! is a Localized CBIR System that uses labeled images in conjunction with a multiple-instance learning algorithm to first identify the desired object and re-weight the features, and then to rank images in the database using a similarity measure that is based upon individual regions within the image. We used the following two data sets:
The multiple-instance model was motivated by the drug activity prediction problem where each instance is a possible configuration (or shape) for a molecule of interest and each bag (example) contains all low-energy (and hence likely) configurations for the molecule.
Public data sets for learning in the multiple-instance model are for concept learning (i.e. boolean labels). Binding affinity between molecules and receptors is quantitative, borne out in quantities such as the energy released by the molecule-receptor pair upon binding and hence a real-valued label of binding strength is preferable.
The naming convention for the artificial data is LJ-r.f.s where: