数字图像处理与应用_浙江大学

播放

数字图像处理与应用

课程详情

课程评价

spContent=深刻理解图像处理原理，生动展现图像处理过程，灵活应用图像处理方法

—— 课程团队

课程概述

图像和视频是人类记录、表达和传递外部世界的重要视觉载体，也是感知外部世界的视觉基础，图像处理是实现物联网、机器视觉和人工智能等相关应用的基本支撑技术。

本课程将从如下八个方面来讲授图像处理的一些基本概念，方法与技术：

1）图像表征（依据图像基信号是否基于数据驱动，介绍图像处理中经典的傅里叶变换，离散余弦变换和基于数据驱动的主元分析法，生动展现同一幅图像在不同变换下的形式）

2）运动估计（分别介绍像素级别的光流法和图像块级别的块匹配算法原理，以及它们应用差异之所在）

3）图像与视频压缩技术（分别介绍包括静态数字图像压缩标准JPEG和视频压缩MPEG原理）

4）图像半色调技术（介绍包括最简单的阈值方法，用于印刷业的聚合型抖动模板和分散型抖动模板，误差传播法等）

5）图像滤波技术（介绍图像中常见的噪声类型，传统图像滤波，如中值滤波和高斯滤波等，以及最近出现针对纹理的滤波方法）

6）图像插值与超分辨率技术（介绍包括传统图像插值方法，和基于图像自相似图像超分辨率技术等）

7）图像边缘检测与分割技术（介绍包括Canny算子，mean-shift图像色彩分割方法等）

8）视频目标跟踪技术（介绍目前较热门的Discriminative Correlation Filter(DCF)的目标跟踪技术原理）。

在授课过程中我们通过理论与实践相结合方式，以及课后大量文献阅读来加深对图像处理基本概念和理论的理解；通过实例来分析比较不同图像处理方法的优缺点；通过提出问题来引导学生独立深入思考。

授课目标

课程的目标是通过学习，能让学员掌握图像处理与计算机视觉中一些基本概念，基本研究思路和方法等，从而帮助他们展开相关领域后续深入的研究工作，和开发相关应用系统等。

课程大纲

预备知识

线性代数，信号与系统，高等数学，统计概率论等。

证书要求

为积极响应国家低碳环保政策， 2021年秋季学期开始，中国大学MOOC平台将取消纸质版的认证证书，仅提供电子版的认证证书服务，证书申请方式和流程不变。

电子版认证证书支持查询验证，可通过扫描证书上的二维码进行有效性查询，或者访问 http://www.icourse163.org/verify，通过证书编号进行查询。学生可在“个人中心-证书-查看证书”页面自行下载、打印电子版认证证书。

完成课程教学内容学习和考核，成绩达到课程考核标准的学生（每门课程的考核标准不同，详见课程内的评分标准），具备申请认证证书资格，可在证书申请开放期间（以申请页面显示的时间为准），完成在线付费申请。

认证证书申请注意事项：

1. 根据国家相关法律法规要求，认证证书申请时要求进行实名认证，请保证所提交的实名认证信息真实完整有效。

2. 完成实名认证并支付后，系统将自动生成并发送电子版认证证书。电子版认证证书生成后不支持退费。

参考资料

[1] C. Tomasi and R. Manduchi, “Bilateral filtering for gray and color images,” in Proc. of IEEE International Conference on Computer Vision (ICCV), 1998.

[2] D. Glasner, S. Bagon, and M. Irani, “Super-resolution from a single image,” in Proc. of IEEE International Conference on Computer Vision (ICCV), pp.349-356, 2009.

[3] P. Felzenszwalb and P. Huttenlocher, “Efficient belief propagation for early vision,” International Journal of Computer Vision, vol.70, no.1, 2006.

[4] D. Comaniciu and P. Meer, “Mean shift: a robust approach toward feature space analysis,” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.24, no.5, pp. 603-619, 2002.

[5] X. Lu, “Color textile image segmentation based multiscale probabilistic reasoning,” Optical Engineering, vol.46, no.8, 087002, 2007.

[6] A. V. Oppenheim, A. S. Willsky and I. T. Young, Signals and Systems, Prentice-Hall, 1983.

[7] M.J.T. Smith and A. Docef, A Study Guide for Digital Image Processing, Scientific Publishers, Inv. Riverdale, Georgia, 1999.

[8] G. Johansson, “Visual perception of biological motion and a model for its analysis", Perception and Psychophysics, vol.14, 201-211, 1973.

[9] B. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proc. of International Joint Conf. On Artificial Intelligence, pp.674-679, 1981.

[10] B. Horn and B. Schunck, “Determining optical flow,” Artificial Intelligence, 17:185-203, 1981.

[11] T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, “Motion compensated interframe coding for video conferencing,” Proceedings of national Telecommunications conference, New Orleans, LA, pp.G5.3.1–G5.3.5, Dec. 1981.

[12] R. Li, B. Zeng, and M. L. Liou, “A new three-step algorithm for block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 4(4): 438-442, 1994.

[13] L.-M. Po and W.-C. Ma, “A novel four-step search algorithm for fast block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 6(3): 313-317, 1996.

[14] S. Zhu and K.-K. Ma, “A new diamond search algorithm for fast block-matching motion estimation,” IEEE Trans. On Image Processing, 9(2): 287-290, 2000.

[15] G. K. Wallace "The JPEG still picture compression standard", Communications of the ACM, 34(4):30-44, April 1991.

[16] Z. Wang, A.C. Bovik, H.R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. On Image Processing, 13(4):600-612, 2004.

[17] B. E. Bayer, “An optimum method for two-level rendition of continuous-tone pictures,” in Proceedings of the IEEE International Conference on Communication, pp. 11-26, 1973.

[18] R. Floyd and L. Steinberg, “An adaptive algorithm for spatial grey scale,” Society for Information Display Symposium, Digest of Technical Papers, pp.36-37, 1975.

[19] X.-Q. Lu and H. Sakaino, “A spatial adaptive filter for smoothing of non-Gaussian texture noise” in Proc. of ICASSP, 2009.

[20] L. Rudin, S. Osher and E. Fatemi, “Nonlinear total variation based noise removal,” Physical D, 60:259-268, 1992.

[21] H. Cho, H. Lee, H. Kang, and S. Lee, ” Bilateral texture filtering,” ACM Transactions on Graphics, 33(4):1-8, 2014.

[22] R. G. Keys, “Cubic convolution interpolation for digital image processing,” IEEE Trans. On Acoustics, Speech, and Signal Processing, ASSP-29(6): 1153-1160, 1981.

[23] X. Li and M. T. Orchard, “New edge-directed interpolation,” IEEE Trans. On Image Processing, 10(10): 1521-1527, 2001.

[24] J. Canny, “A Computational Approach to Edge Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, 8(6), pp. 679-698, 1986.

[25] J.W. Cook and E.J. Delp, “Multiresolution sequential edge linking,” in Proc. IEEE Intl. Conf. Image Processing, pp.41-44, 1995.

[26] D. Comaniciu and P. Meer, "Robust analysis of feature spaces: color image segmentation," in Proc. of IEEE CVPR1997.

[27] D.S. Bolme, J.R. Beveridge, B.A. Draper, and Y.M. Lui, “Visual object tracking using adaptive correlation filters,” in Proc. of CVPR, 2010.

[28] J.F. Henriques, R. Caseiro, P. Martins, and J. Batista, “High-speed tracking with kernelized correlation filter” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.37, no.3, pp.583-596, Mar. 2015.

友情链接

关注我们

关于我们