数字图像处理与应用_浙江大学

播放

数字图像处理与应用

第1次开课

开课时间： 2019年10月14日 ~ 2019年12月31日

学时安排： 3-5小时每周

当前开课已结束已有 16107 人参加

立即自学

往期不提供结课证书，想参加下学期课程，点击这里预约>>

课程详情

课程评价(39)

spContent=深刻理解图像处理原理，生动展现图像处理过程，灵活应用图像处理方法

深刻理解图像处理原理，生动展现图像处理过程，灵活应用图像处理方法

—— 课程团队

课程概述

图像和视频是人类记录、表达和传递外部世界的重要视觉载体，也是感知外部世界的视觉基础，图像处理是实现物联网、机器视觉和人工智能等相关应用的基本支撑技术。

本课程将从如下八个方面来讲授图像处理的一些基本概念，方法与技术：

1）图像表征（依据图像基信号是否基于数据驱动，介绍图像处理中经典的傅里叶变换，离散余弦变换和基于数据驱动的主元分析法，生动展现同一幅图像在不同变换下的形式）

2）运动估计（分别介绍像素级别的光流法和图像块级别的块匹配算法原理，以及它们应用差异之所在）

3）图像与视频压缩技术（分别介绍包括静态数字图像压缩标准JPEG和视频压缩MPEG原理）

4）图像半色调技术（介绍包括最简单的阈值方法，用于印刷业的聚合型抖动模板和分散型抖动模板，误差传播法等）

5）图像滤波技术（介绍图像中常见的噪声类型，传统图像滤波，如中值滤波和高斯滤波等，以及最近出现针对纹理的滤波方法）

6）图像插值与超分辨率技术（介绍包括传统图像插值方法，和基于图像自相似图像超分辨率技术等）

7）图像边缘检测与分割技术（介绍包括Canny算子，mean-shift图像色彩分割方法等）

8）视频目标跟踪技术（介绍目前较热门的Discriminative Correlation Filter(DCF)的目标跟踪技术原理）。

在授课过程中我们通过理论与实践相结合方式，以及课后大量文献阅读来加深对图像处理基本概念和理论的理解；通过实例来分析比较不同图像处理方法的优缺点；通过提出问题来引导学生独立深入思考。

授课目标

课程的目标是通过学习，能让学员掌握图像处理与计算机视觉中一些基本概念，基本研究思路和方法等，从而帮助他们展开相关领域后续深入的研究工作，和开发相关应用系统等。

课程大纲

数字图像处理与应用课程简介(Introduction)

课时目标：我们将在这节中介绍课程大概内容，以及为什么我们需要做数字图像处理，展示图像处理一些经典应用。

1.1 Introduction

自然图像的表征方法(Image Representation)

课时目标：在这节中，我们介绍一些常见的图像表征方法，如最直观的空间域图像表征，基于先验图像基的傅里叶变换，DCT变换和基于数据驱动的PCA方法等。

2.1 Spatial Domain Representations of Natural Images

2.2 Fourier Transform

2.2.1 Basic Exponential Signals and Fourier Transform

2.2.2 Continuous-time Fourier Series and Discrete-time Fourier Series

2.2.3 DFT (Discrete Fourier Transform)

2.2.4 2D-DFT

2.3 DCT (Discrete Cosine Transform)

2.4 PCA (Principal Component Analysis)

运动估计(Motion Estimation)

课时目标：在这节中，我们将分别介绍像素级别的光流法和图像块级别的块匹配算法的原理，比较不同方法的性能。

3.1 Introduction of Optical Flow

3.2 Introduction of LK algorithm

3.3 Introduction of HS algorithm

3.4 The Principle of Block Matching Algorithms

3.5 Three Step Search Algorithm and New Three Step Search Algorithm

3.6 Four Step Search Algorithm and Diamond Search Algorithm

图像与视频压缩技术(Image & Video Compression)

课时目标：这一节中，我们介绍静态图像压缩标准 --- JPEG 和视频压缩 --- MEPG框架，让学生能掌握图像和视频压缩的原理，以及图像和视频压缩流程。

4.1 Introduction of JPEG

4.2 Introduction of MPEG

人眼视觉系统(Introduction of Human Vision System)

课时目标：由于图像和视频都是通过人眼观看的，所以介绍人眼视觉系统一些重要特征是非常必要的，我们将这一节中通过介绍“Mach Bands”来解释为什么标准灰度图像每个像素点上灰度值要用8比特来表示。

5.1 Introduction of Human Vision System (HVS)

5.2 Different Color Spaces and Evaluation Metrics

图像滤波技术(Image Filtering)

课时目标：在这一节中我们将首先介绍常见的图像噪声，如椒盐噪声，高斯白噪声和纹理噪声等，然后针对不同噪声类型，分别介绍不同的图像滤波方法，包括图像中值滤波，双边滤波和双边纹理滤波等。

6.1 Image Noise and Median Filtering

6.2 Bilateral Filtering (ICCV'1998)

6.3 Nonlinear Total Variation Filtering

6.4 Bilateral Textural Filtering (SIGGRAPH'2014)

图像插值与超分辨率技术(Image Interpolation and Superresolution)

课时目标：在这一节中，我们不但介绍传统图像插值方法，如双线性插值，还介绍边缘导向的图像插值方法（发表在IEE Trans. on Image Processing）2001上论文，以及发表于ICCV'2009上的基于图像自相似的图像超分辨率方法。

7.1 Nearest Neighbor and Bilinear Interpolation

7.2 Bicubic Convolution Interpolation

7.3 Edge-directed Interpolation (TIP'2001)

7.4 Image Superresolution from a Single Image (ICCV'2009)

边缘检测与图像分割方法(Edge Detection and Segmentation)

课时目标：在这一节中我们介绍经典的边缘检测算子---Canny 算子和它的一些变异算子，以及基于mean-shift图像分割算法。

8.1 Introduction of Canny Operator

8.2 Introduction of Mean-shift Algorithm

视频目标跟踪技术(Tracking)

课时目标：在这节中，我们介绍目前比较热门的视觉跟踪技术 --- Discriminative Correlation Filter和其中一种著名的算法KCF --- Kernelized Correlation Filtering。我们分析为什么DCF可以应用实时跟踪系统，以及它的局限性等。

9.1 The Principle of DCF and KCF

展开全部

预备知识

线性代数，信号与系统，高等数学，统计概率论等。

证书要求

为积极响应国家低碳环保政策， 2021年秋季学期开始，中国大学MOOC平台将取消纸质版的认证证书，仅提供电子版的认证证书服务，证书申请方式和流程不变。

电子版认证证书支持查询验证，可通过扫描证书上的二维码进行有效性查询，或者访问 https://www.icourse163.org/verify，通过证书编号进行查询。学生可在“个人中心-证书-查看证书”页面自行下载、打印电子版认证证书。

完成课程教学内容学习和考核，成绩达到课程考核标准的学生（每门课程的考核标准不同，详见课程内的评分标准），具备申请认证证书资格，可在证书申请开放期间（以申请页面显示的时间为准），完成在线付费申请。

认证证书申请注意事项：

1. 根据国家相关法律法规要求，认证证书申请时要求进行实名认证，请保证所提交的实名认证信息真实完整有效。

2. 完成实名认证并支付后，系统将自动生成并发送电子版认证证书。电子版认证证书生成后不支持退费。

参考资料

[1] C. Tomasi and R. Manduchi, “Bilateral filtering for gray and color images,” in Proc. of IEEE International Conference on Computer Vision (ICCV), 1998.

[2] D. Glasner, S. Bagon, and M. Irani, “Super-resolution from a single image,” in Proc. of IEEE International Conference on Computer Vision (ICCV), pp.349-356, 2009.

[3] P. Felzenszwalb and P. Huttenlocher, “Efficient belief propagation for early vision,” International Journal of Computer Vision, vol.70, no.1, 2006.

[4] D. Comaniciu and P. Meer, “Mean shift: a robust approach toward feature space analysis,” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.24, no.5, pp. 603-619, 2002.

[5] X. Lu, “Color textile image segmentation based multiscale probabilistic reasoning,” Optical Engineering, vol.46, no.8, 087002, 2007.

[6] A. V. Oppenheim, A. S. Willsky and I. T. Young, Signals and Systems, Prentice-Hall, 1983.

[7] M.J.T. Smith and A. Docef, A Study Guide for Digital Image Processing, Scientific Publishers, Inv. Riverdale, Georgia, 1999.

[8] G. Johansson, “Visual perception of biological motion and a model for its analysis", Perception and Psychophysics, vol.14, 201-211, 1973.

[9] B. Lucas and T. Kanade, “An iterative image registration technique with an application to stereo vision,” in Proc. of International Joint Conf. On Artificial Intelligence, pp.674-679, 1981.

[10] B. Horn and B. Schunck, “Determining optical flow,” Artificial Intelligence, 17:185-203, 1981.

[11] T. Koga, K. Iinuma, A. Hirano, Y. Iijima, and T. Ishiguro, “Motion compensated interframe coding for video conferencing,” Proceedings of national Telecommunications conference, New Orleans, LA, pp.G5.3.1–G5.3.5, Dec. 1981.

[12] R. Li, B. Zeng, and M. L. Liou, “A new three-step algorithm for block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 4(4): 438-442, 1994.

[13] L.-M. Po and W.-C. Ma, “A novel four-step search algorithm for fast block motion estimation,” IEEE Trans. On Circuits and Systems for Video Technology, 6(3): 313-317, 1996.

[14] S. Zhu and K.-K. Ma, “A new diamond search algorithm for fast block-matching motion estimation,” IEEE Trans. On Image Processing, 9(2): 287-290, 2000.

[15] G. K. Wallace "The JPEG still picture compression standard", Communications of the ACM, 34(4):30-44, April 1991.

[16] Z. Wang, A.C. Bovik, H.R. Sheikh, and E. P. Simoncelli, “Image quality assessment: from error visibility to structural similarity,” IEEE Trans. On Image Processing, 13(4):600-612, 2004.

[17] B. E. Bayer, “An optimum method for two-level rendition of continuous-tone pictures,” in Proceedings of the IEEE International Conference on Communication, pp. 11-26, 1973.

[18] R. Floyd and L. Steinberg, “An adaptive algorithm for spatial grey scale,” Society for Information Display Symposium, Digest of Technical Papers, pp.36-37, 1975.

[19] X.-Q. Lu and H. Sakaino, “A spatial adaptive filter for smoothing of non-Gaussian texture noise” in Proc. of ICASSP, 2009.

[20] L. Rudin, S. Osher and E. Fatemi, “Nonlinear total variation based noise removal,” Physical D, 60:259-268, 1992.

[21] H. Cho, H. Lee, H. Kang, and S. Lee, ” Bilateral texture filtering,” ACM Transactions on Graphics, 33(4):1-8, 2014.

[22] R. G. Keys, “Cubic convolution interpolation for digital image processing,” IEEE Trans. On Acoustics, Speech, and Signal Processing, ASSP-29(6): 1153-1160, 1981.

[23] X. Li and M. T. Orchard, “New edge-directed interpolation,” IEEE Trans. On Image Processing, 10(10): 1521-1527, 2001.

[24] J. Canny, “A Computational Approach to Edge Detection,” IEEE Trans. Pattern Analysis and Machine Intelligence, 8(6), pp. 679-698, 1986.

[25] J.W. Cook and E.J. Delp, “Multiresolution sequential edge linking,” in Proc. IEEE Intl. Conf. Image Processing, pp.41-44, 1995.

[26] D. Comaniciu and P. Meer, "Robust analysis of feature spaces: color image segmentation," in Proc. of IEEE CVPR1997.

[27] D.S. Bolme, J.R. Beveridge, B.A. Draper, and Y.M. Lui, “Visual object tracking using adaptive correlation filters,” in Proc. of CVPR, 2010.

[28] J.F. Henriques, R. Caseiro, P. Martins, and J. Batista, “High-speed tracking with kernelized correlation filter” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol.37, no.3, pp.583-596, Mar. 2015.

显示全部

1 位授课老师

陆系群

副教授

陆系群

友情链接

关注我们

关于我们

陆系群

推荐课程

【DeepSeek适用】小白玩转AI大模型应用开发

林粒粒

小白玩转 Python 数据分析

林粒粒

友情链接

关注我们

关于我们