【技术综述】AVA数据集后时代与展望

2019-01-16 本文已影响8人有三AI

本文首发于微信公众号《与有三学AI》

走向AI摄影终极之路 AVA数据集后时代与发展？

前面已经介绍过当今最大的美学数据集AVA以及AVA之前的数据集，AVA数据集的发布是2012年，离现在已经过去了5年，在机器学习迭代如此频繁的日子里，必然会出现新的数据集。

本文就略作介绍，也是数据集介绍的最后一篇文章。在准备好这些之后，就要开始真正的搞起了！

1， AADB【1】（Aesthetic with Attributes Database）

总的来说，AADB算是AVA数据集的一个补充。标注的方式，是请了5个人，最终的score取5个人的平均值，共10000张图。除了标注分数外，也标注了11个属性。

与AVA数据集的区别主要在于：

a) AVA中包含了很多非真实的摄影图，以及后期处理过的图，所以AVA中分数超过5分（满分为10分）的占据绝大多数。但是AADB中，则更多地考虑了专业摄影者和普通拍照者图的均衡，基本是1:1。

b) 由于标注者少，AADB专门去分析了标注者的标注一致性。间接反映出标注者的质量，也就是证明了标注者是具有很高的标注水准。结果具有了很高的一致性，是可靠的；

c) 关于图像属性，也就是风格的标注，AADB给AVA做了补充。

与AVA一样，AADB也标注了属性，那有什么不同呢？

那么，我们先回顾一下AVA的14个属性，括号内是包含该属性的图的数量：Complementary Colors (949), Duotones (1,301), High Dynamic Range (396), Image Grain (840), Light on White (1,199), Long Exposure (845), Macro (1,698), Motion Blur (609), Negative Image (959), Rule of Thirds (1,031), Shallow DOF (710), Silhouettes (1,389), Soft Focus (1,479), Vanishing Point (674).

然后我们看看AADB的11个属性。

1. “balancing element” – whether the image contains balanced elements;

2. “content” – whether the image has good/interesting content;

3. “color harmony” – whether the overall color of the image is harmonious;

4. “depth of field” – whether the image has shallow depth of field;

5. “lighting” – whether the image has good/interesting lighting;

6. “motion blur” – whether the image has motion blur;

7. “object emphasis” – whether the image emphasizes foreground objects;

8. “rule of thirds” – whether the photography follows rule of thirds;

9. “vivid color”–whether the photo has vivid color, not necessarily harmonious color;

10. “repetition” – whether the image has repetitive patterns;

11. “symmetry” – whether the photo has symmetric patterns.