Image Classification and Annotation、图像标注(image annotation)、图像检索(image retrieval)

Caltech101是属于image categorization数据库，见Linear Spatial Pyramid Matching using Sparse Coding for Image Classification(CVPR09-ScSPM)摘要写了： In a number of image categorization experiments, 其标题是Image Classification。Mingming Gong讲image/object Classification/categorization 四个是一样的概念.

我自己到谷歌上搜索：object categorization COIL,发现A novel color-context descriptor and its applications (ICME 2009)摘要最后一句： Experiments validate the discriminant power of the proposed descriptor in object categorization on COIL- 100 database and pedestrian identification in surveillance videos.故Deng Cai主页的COIL是属于object categorization的

63486.htm
Learning a Maximum Margin Subspace for Image Retrieval
Dong Xu's Phd thesis Section 7
Cooperative Sparse Representation Semi-supervised Image Annotation

---------------------------------------------------------------------------------------------------------------------------------------------------

图像分类是测试样本的预测label和实际的label比较得准确率。图像标注本质也是分类，是测试样本预测出来的tag和实际的tag做比较得准确率，看voc 07的训练tag，是5011*804的矩阵，也就是总共804个tag，如果该样本有这个tag，则该位置的tag是1否则是0.一个图像可以有多个tag，image annotation本质也是multi-label的问题。是不是所有样本的tag确实只有804个，其实有更多，将一些频率比较低的去掉了。Yong Luo讲在测试时有这样的情况，测试样本没有tag，这时没法算准确率，就将这个测试样本去掉。

---------------------------------------------------------------------------------------------------------------------------------------------------------
image annotation就是image classification. 见Cooperative Sparse Representation Semi-supervised Image Annotation. VI节A节第三段：with the annotations from a set of total 20 keywords.This is discussing with Weifeng Liu and Yong Luo. Sparse Unsupervised Dimensionality Reduction for Multiple View Data该文IV节E节 2)就用的Image Classification and Annotation.该文用三个数据集，MIML仅有类别没有tag，后两个仅有tag没有类别，论文说了分别81和100个tag，故MIML是测试样本的预测label和实际的label比较得准确率，后两个库是是测试样本的预测tag和实际的tag比较得准确率。Yong Luo讲没有人在voc上做image annotation，因为其tag没有太多的语义信息，比如pascal07_dictionary的804个tag中还有2003，voc的tag只能作为特征。

Yong Luo：另外还有三个做annotation的数据库Corel 5K，IAPR TC-12，ESP GAME. Annotation是给图像加标注，阿秋TIP fig 3. Tian Xia MSE实验第三节是Video annotation, Tian Xia写了，和image retrival做法一样。这三个数据库Yong Luo 建议不要按照ML-KNN的五个指标来进行比较。Image annotation就按照annotation的指标，TagProp Discriminative Metric Learning (ICCV 2009)Table 3的P、R和N+。这三个指标的定义该文讲了，该文参考文献17 A. Makadia, V. Pavlovic, and S. Kumar. A new baseline for image annotation. In ECCV, 2008也讲了。

整个领域基本都是分类或者回归问题，少部分是回归，比如是年龄回归(age regression)。mingming gong said that介于两者之间的还有一个是ordinal regression = ranking(值是离散的，还有大小)

发表于 2012-08-05 11:36 杰哥阅读(1466) 评论(0) 编辑收藏引用所属分类: 学术

常用链接

留言簿(58)

随笔分类

随笔档案

相册

Other

Paper submission

福彩

留学相关

论坛

搜索

学者

邮箱

中科大和中科院

搜索

最新评论

阅读排行榜

评论排行榜

Image Classification and Annotation、图像标注(image annotation)、图像检索(image retrieval)