Code

注意事项: These resources might need a password to be opened. Please supply your name, institute, and positions to get the password. Click Google form or Wenjuan to do this. The source code is free for research and education use only. Please cite our paper if you use any part of our source code or data in your research.

Source code

We open-source all papers by default, except for a few of them supported by commercial companies with NDA (non-disclosure agreement). Thus, we stop updating the following list. Please go to the publication page for the source code.

C++: Salient Object Detection: A Discriminative Regional Feature Integration Approach, IJCV, 2017. [project] [bib]
C++: HFS: Hierarchical Feature Selection for Efficient Image Segmentation. ECCV, 2016. [project][bib]
C++ & Matlab: Salient Object Detection: A Benchmark, IEEE TIP, 2015. [project]
C++ & CUDA: Struck: Structured output tracking with kernels, IEEE TPAMI, 2016. [pdf]
C++ & CUDA: SemanticPaint: Interactive 3D Labeling and Learning at your Fingertips, ACM TOG 2015. [project]
C++: DenseCut: Densely Connected CRFs for Realtime GrabCut, CGF 2015. [project] (I didn’t get enough time to reorganize the code. This is an initial version code without careful comments and organization.)
C++ & CUDA: A Framework for the Volumetric Integration of Depth Images, arxiv eprint, 2014. [project]
C++: ‘BING: Binarized Normed Gradients for Objectness Estimation at 300fps’, IEEE CVPR 2014. [project] [bib][readme]
C++: Efficient Salient Region Detection with Soft Image Abstraction”, IEEE ICCV 2013. [project] [bib]
C++(speech recognition part): “ImageSpirit: Verbal Guided Image Parsing”, ACM TOG, 2014, [bib] [project]。 Full code from Github.
C++: “Global Contrast based Salient Region Detection”, IEEE TPAMI 2015. [project] [FAQs] [bib]
C++: “Curve Structure Extraction for Cartoon Images” [Pdf] [bib]
Executable 和 C++: “A Shape-Preserving Approach to Image Resizing”. CGF 2009. [project] [bib]
Commercial version (神笔小Q, powered by QQ实验室) for our paper “Sketch2Photo: Internet Image Montage“, ACM TOG, 2009. [bib]
Matlab: “Connectedness of Random Walk Segmentation”, IEEE TPAMI 2009. [bib]

License

The code is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International Public License for NonCommercial use only. Any commercial use should get formal permission first.

Data

COD10K (Baidu w3up| Google ) (2.25G): Camouflaged Object Detection, CVPR, 2020.
SIP1K (Baidu pan fetch code: 46w8 | Google drive )]: Rethinking RGB-D Salient Object Detection: Models, Data Sets, and Large-Scale Benchmarks, TNNLS, 2020. (Human-centered RGB-D salient object detection)
DAVSOD (Baidu [fetch code: ivzo]| Google)]: Shifting More Attention to Video Salient Object Detection, CVPR, 2019. (Video Salient Object Detection)
SOC6K (TestSet GT) [Object-level]|[Instance-level]
SOC6K ([730.2MB] Baidu | Google): Salient Objects in Clutter: Bringing Salient Object Detection to the Foreground, ECCV, 2018. (First one largest instance-level salient object detection dataset with high-quality rich annotations, e.g., object level, instance level, and image-level annotations. ) [project page][bib].
aNYU: attributes augmented NYU dataset published with our CVPR 2014 paper “Dense Semantic Image Segmentation with Objects and Attributes” (Please follow the instructions on the project page for getting unzip password).
THUR15K (787MB): Salient object region labeling (annotated at pixel accuracy if exist such a salient object) for 15000 non-selected internet images, divided into five groups according to the keywords used to download them. Please cite our paper [BIB] if you use it. We also provide a Matlab file of Fig. 7(b) for easier comparison.
MSRA10K: Pixel accurate salient object labeling for 10000 images from the MSRA dataset.
PASCAL VOC annotations: Unfortunately, the XML format used in one of the most widely used benchmark datasets (PASCAL VOC) does not compatible with OpenCV. Here is a YML version of PASCAL VOC annotations (VOC 2007), which could be read by OpenCV directly.

Patents

International patents: (Espacenet)

Method and Apparatus for Detecting Salient Object in Image, Qibin Hou, Mingming Cheng, Wei Bai, Xunyi Zhou, US20200143194
Image processing method and image processing device, Shi-Min Hu, Ming-Ming Cheng, Guo-Xin Zhang, Niloy J. Mitra, Xiang Ruan, US20120288189, EP2523165, CN102779338A, JP2012243313A
Method for calculating image visual saliency based on color histogram and overall contrast, Shi-Min Hu, Ming-Ming Cheng, Guo-Xin Zhang, WO 2012122682, PCT/CN2011/000690
Method for detecting similar units based on outline belt graph, Shi-Min Hu, Ming-Ming Cheng, Fang-Lue Zhang, WO 2011131029, CN 101833668

Chinese patents (企知道, 中国专利公告):

程明明，姜鹏涛，张长彬，侯淇彬，曹洋，基于在线注意力累积的挖掘目标物体区域的方法，申请号：201910715341.X，申请日：2019-08-05。
杨巨峰，夏爽，冀晓东，陈松，程明明，一种基于多序列MRI的多发性骨髓瘤病灶的分割系统，申请号：201910582883.4，申请日：2019-07-01。
杨巨峰，梁杰，程明明，一种基于正交元空间的多模态图像增强方法，申请号：201910571058.4，申请日：2019-06-28。
范登平;程明明;张钊，一种基于深度图过滤器的显著性物体检测方法，申请号：201910524475.3，申请日：2019-06-18。
程明明，杨巨峰，伍小平，展翅，一种基于注意力机制和多层次卷积特征的植物病虫害识别方法，申请号：201910404278.8，申请日：2019-05-16。
杨巨峰，程明明，孙晓晓，陈丽怡，一种基于对抗神经网络和海量噪声数据的图片分类方法，申请号：201910358002.0，申请日：2019-04-30。
程明明，范登平，林铮，吴文海，一种基于注意力转移机制的视频显著性物体检测方法，申请号：201910347420.X，申请日：2019-04-28。
程明明，刘云，吴宇寰，基于多路割的弱监督实例分割方法，申请号：201910347532.5，申请日：2019-04-28。
杨巨峰，程明明，陈丽怡，折栋宇，孙晓晓，一种基于上下文信息的艺术画风格分类方法，申请号：201910311265.6，申请日：2019-04-18。
程明明，高尚华，赵凯，可集成到神经网络架构中的图像多尺度信息提取方法及应用，申请号：201910242489.6，申请日：2019-03-28。
程明明，赵凯，一种基于互斥正则化技术的人脸识别方法，申请号：201910235306.8，申请日：2019-03-27。
程明明，陈林卓，李炫毅，基于空间注意力机制的点云特征提取方法，申请号：201910235177.2，申请日：2019-03-27。
程明明，刘云，基于多层次上下文信息融合的显著性物体检测方法，申请号：201811547592.3，申请日：2018-12-18。
杨巨峰，程明明，折栋宇，王恺，一种基于深度度量学习的图像情感分类与检索算法，申请号：201810173303.1，申请日：2018-03-02。
刘姜江，程明明，侯淇彬，范登平，谭永强，一种基于深度网络的多类型任务通用的检测方法，申申请号：201810173285.7，申请日：2018-03-02。
杨巨峰，程明明，梁杰，王恺，基于三元组的自动估计类数的子空间聚类方法，申请号：201810170826.0，申请日：2018-03-01。
杨巨峰，程明明，孙延，梁杰，王恺，一种基于多视角多特征的图像印象性预测方法，申请号：201810171247.8，申请日：2018-03-01。
范登平，程明明，曹洋，吴宇寰，任博，一种基于二进制的前景图相似度评测方法，申请号：201810171102.8 申请日：2018-03-01。
杨巨峰，程明明，孙晓晓，王恺，一种基于迭代采样和一对多标签修正的Web图像训练卷积神经网络方法，申请号：201810171017.1，申请日：2018-03-01。
侯淇彬，程明明，白蔚，周迅溢，图像显著性物体检测方法和装置，申请号：201710488970.4，申请日：2017年6月26日。
程明明，刘笑畅，白蔚，苗磊，一种图片处理方法及设备，申请号：201710602208.4，申请日期：2017年7月24日。
程明明，刘云，侯淇彬，白蔚，图像分割方法及装置，申请号：201610850223.6，申请日：2016年9月28日。
胡事民，张方略，程明明，基于成组物体混合的图像合成方法，申请号：201110262737.7 申请日：2011-09-06。
胡事民，程明明，张国鑫，基于色彩直方图和全局对比度的图像视觉显著性计算方法，申请号：201110062520.1, 授权日：2012年7月25日。
胡事民，程明明，张方略，一种基于轮廓带图的相似单元检测方法，申请号：201010159931.8，授权日： 2011年12月28日。
胡事民，程明明，张方略，基于几何信息的相似图元隐藏部分自动补全方法，申请号：201010158440.1，授权日： 2012年9月5日。
胡事民，程明明，张国鑫，一种基于共形能量的内容敏感图像缩放方法，申请号：200910092756.2，授权日2011年12月28日。
胡事民，陈韬，程明明，张松海，基于图像库的图像合成质量自动评测方法，申请号：200910086937.4，受权日2011年9月14日。
胡事民，陈韬，程明明，张松海，基于混合梯度场和混合边界条件的图像合成方法和装置，申请号：200910084769.5，授权日2011年5月11日。
胡事民，程明明，陈韬，张松海，一种基于草图的网络图元自动提取方法和系统，申请号：200910081069，授权日2010年9月1日。
胡事民，张一飞，程明明，视频像素可伸缩性的计算方法，申请号：200810114466.9，授权日：2009年12月9日。
胡事民，程明明，陈韬，张松海，基于卡通片的高质量线结构提取方法，申请号：200810106664，授权日2009年9月25日。
程明明，张少甫，李文博，周瑞，仇玲，一种节能的路灯亮度随需动态调节系统及控制方法，申请号：200810018101，授权日：2012年4月25日。

(Visited 270,211 times, 21 visits today)

433 Comments

Inline Feedbacks

View all comments

Guanbin Li

10 years ago

您好，请问你有修改过RC部分的代码吗？怎么感觉现在跑出来的结果比很多论文比较的结果好了？

你好，我跑了你的代码，为什么GC的结果没有RC好，是正常的吗？

陈小曼

HELLO 程老师

我下载了您的 Global Contrast based Salient Region Detection这篇文章的代码:CmCode-master.zip ，用VS2010运行时，编译没有问题，但是运行的时候，出现如下问题（Debug和Release都不行）：
无法启动程序”..\CmCode-master\CmLib\Release\CmLib.lib”，指定文件的格式无法识别或为不支持的二进制。
请问老师这个该如何解决呢？诚恳请求老师解答，谢谢！

吴仲华

Reply to 陈小曼

请问你解决这个问题了吗，我也遇到了，请教一下我，谢谢。

尹莹莹

Reply to 吴仲华

9 years ago

请问楼上两位，这个问题解决了么？我也遇到了。。。求指教啊。。。

黄凌

谢谢

高航

杨尚明

老师，现在那个获取解压密码的链接崩了，能提供下密码么？

MM Cheng

Author

Reply to 杨尚明

刚试了还没问题。

GSH

您好，程老师，我下了您Global Contrast based Salient Region Detection这篇文章的代码，请问一下您这篇文章代码中生成的Result.m中的xticklabel_rotate是您又用matlab单独写的函数吗？还有图12中（c）Saliency segmentation methods中的F-measure图在代码CmEvaluation.cpp部分我并没有找到。刚开始学习saliency方面的东西，希望您看到后能给我回复，非常感谢！

Reply to GSH

你好，xticklabel_rotate.m 你Google一下就能找到了，只是调整格式用的，没太大关系。另外F-Measure的数据已经在results.m 中包含。如果没看到可以自己plot一下就好了。祝好。

Reply to MM Cheng

好的，谢谢程老师！

mengying

很好

Jia-Zhou

程老师你好，我从github下载了cmcode代码，但在编译的时候出现错误，错误定位到“FILE *f; fopen_s(&f, _S(resFile), “a”);CV_Assert(f != NULL);”这三行代码，错误提示是“0x0F40CAB4 (msvcr100d.dll) (Saliency.exe 中)处有未经处理的异常: 0xC0000005: 读取位置 0x4C554E20 时发生访问冲突。”不知道是什么原因，请求解答，谢谢。

weiguo

Reply to Jia-Zhou

您好，我在下载好的文件夹里找cmcode.sln，结果打不开是什么原因啊

Reply to weiguo

可能是你的visual studio版本太低

一般这个问题是opencv安装不正确。请下载最新的64位版本opencv，并正确配置

老师，提供的代码只能在64位机下面运行吗？我的机子是32位的。

panpan

请问你的问题解决了吗？怎么解决的？我也有这样的错误：OpencCV Error：Assertion failed(f !=NULL) in unkown function，求助怎么解决？

王震

程老师您好：
我在在win8.1+freeglut2.81+nvidia CUDA6.5+OpenNI 2.2.0.33环境下运行您的InfiniTAM 代码。在使用例子里面的PPM 和PGM文件测试的时候只有最左侧的Reconstruction窗口有实时的的图像变化，而右侧的Live depth image和Live colour image窗口只有一开始就载入的静止画面并不随着左侧的三维重建窗口画面同步变化。请问这是什么问题呢？
谢谢

张振武

程老师您好，

您的代码可以下载吗，在吗下载页面没有下载项啊？

Reply to 张振武

当然可以了，请按照提示操作。

Xiang Yu

程老师：

您好！我在跑您的程序的时候发现apFun = (apcluster32)GetProcAddress(dlh, “apcluster32”);得到的总是空值，是需要自己去下一个AP聚类的dll吗？

Reply to Xiang Yu

按照您程序里面的提示下载了dll之后正常了

liyong

程老师，你好，
请问“Global Contrast based salient region detection”和”Salient Object Detection and Segmentation”的源码是不是一样的，都是SaliencyICCV2013.zip吗？

Reply to liyong

Global Contrast based salient region detection 是 CVPR 2011 和PAMI 2014的那个。SaliencyICCV2013.zip是ICCV 2013的。你根据我这个页面的链接，就可以找到对应的了。

谢谢程老师，请问SalientObjectCVPR2011.zip和SaliencyICCV2013.zip的密码怎样获取？

密码看到了，谢谢程老师！

wangcheng

您好您这个代码运行起来了吗能发给我一份吗

Marc Wang

尊敬的程老师，您好！
我的环境：Win7_64-bit + Visual Studio 2013 + OpenCV2.4.9；
配置：在GitHub上下载CmCode后，在”配置管理器“中把除CmLib和Saliency以外的项目的生成选项都关掉了，单启动项目设为Saliency，CmLib和Saliency都采用Release_Win32编译，；
报错信息：error LNK1104: 无法打开文件”CmLib.lib”；

是否要另外将“CmLib.lib”的生成路径加到Saliency项目的附加库目录里？

Reply to Marc Wang

嗯。是的。设置好路径就行了。

CCL

你好，可不可以麻烦你抽空给我发一份CmCode这个代码，我没用过Git服务器，电脑太卡了，不想再安装了，我邮箱1669103125@qq.com

Kerwin

程老师您好，我在运行cmcode这个项目的时候，提示我Cluster\CmColorQua.cpp(323): error C2678: 二进制“/=”: 没有找到接受“cv::Vec3f”类型的左操作数的运算符(或没有可接受的转换)，这样一个错误。是因为我的opencv版本不对吗？我用的是2.3.1.

Reply to Kerwin

你试试用更高版本的opencv，例如最新版，应该就没问题了。

didijiang

程老师，请问下在论文《Dense Semantic Image Segmentation with Objects and Attributes》里，为什么需要处理不同级别的属性使用，如果只有一个像素级不可以吗

Reply to didijiang

这个是在CRF里面常见的一个设计。Region level可以包含一些pixel level中很难有效提取的属性，另外同一个region通常是同一个label，把他们组合在一起可以防止小的噪音区域对结果的影响，从而得到更好的边缘。

Alex

Hello
Heverton Sarah！
Are you success in testing the code? I try to test it , but i failed. welcome to contact me by email to talk about question.
My email: shdxky2012@163.com
shanghai universiity

QiuChunchun

程老师，您好！我用你的源码测试自己的图像，最后出来的proposals近1W，这个应该不会和图像大小有关系吧？还是我哪里没注意修改呢？

Reply to QiuChunchun

每个尺度的bias 项是不一样的。你仔细看看stage II。

Heverton Sarah

Hi,

I would like to test the code from the paper “Global Contrast based Salient Region Detection”. Which project should I compile? Is it the Saliency project?

Thanks,

Heverton Sarah
Fluminense Federal University

Reply to Heverton Sarah

Reply to Alex

Do you have a compilation error or?

Yes. You should compile the saliency project.

孔祥衡

我刚刚看了您BING的代码。在判断BBOX是否被检测出来的时候，使用重合面积除于两个BOX覆盖的面积>0.5。这种描述在label的BOX在结果的BOX里且只占一半的情况，是否合理呢？

Reply to 孔祥衡

这个是PASCAL VOC的一个判断标准。这一类方法绝大部分都用这个标准。

« Previous 1 2 3 4 5 6 … 11 Next »

wpDiscuz