전체 글 53

[논문 리뷰] Video Super-Resolution With Convolutional Neural Networks

1) Paper Bibliography Title : Video Super-Resolution With Convolutional Neural Networks Authors : Armin Kappeler, Seunghwan Yoo, Qiqin Dai, and Aggelos K. Katsaggelos IEEE transactions on computational imaging June, 2016 2) Problems & Motivations Problems Most video content is not available in UHD resolution. Therefore SR algorithms are needed to generate UHD content from Full HD (FHD) (1,920 × ..

[PyTorch][에러 해결] RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [32]], which is output 0 of SelectBackward, is at version 165; expected version 164 instead. H..

CUDA 11.1 Pytorch 1.8.0 GCN Layer 사이에 Attention mechanism을 구현하는 도중 아래와 같은 에러가 발생했다. RuntimeError: one of the variables needed for gradient computation has been modified by an inplace operation: [torch.cuda.FloatTensor [32]], which is output 0 of SelectBackward, is at version 165; expected version 164 instead. Hint: enable anomaly detection to find the operation that failed to compute its gradien..

Programming/Python 2021.10.19

[구현] Block-based Translational, Affine, Perspective Motion Estimation을 직접 구현해보기 (3)

소스 코드 using System; using System.IO; namespace MEModule_Perspective { class Program { static void Main(string[] args) { ... } private static byte[,,] full_search(byte[] luma, int frame, int width, int height) { byte[,,] result = new byte[frame - 1, width, height]; ulong[] allResi = new ulong[frame - 1]; int MB = 32; // 매크로블록 사이즈 int SubMB = 16; // Sub 블록 사이즈 int SU = 16; // 매크로블록에서 위아래 얼마나 더 가져올..

[구현] Block-based Translational, Affine, Perspective Motion Estimation을 직접 구현해보기 (2)

소스 코드 using System; using System.IO; namespace MEModule_Affine { class Program { static void Main(string[] args) { ... byte[] result_stream = make_stream(full_search(luma, frame, width, height), frame, width, height); ... } private static byte[,,] full_search(byte[] luma, int frame, int width, int height) { byte[,,] result = new byte[frame - 1, width, height]; ulong[] allResi = new ulong[frame -..

[구현] Block-based Translational, Affine, Perspective Motion Estimation을 직접 구현해보기 (1)

사전 정의 소스 코드 using System; using System.IO; namespace MEModule_ORG { class Program { static void Main(string[] args) { string path = "../../../sequence/BasketballDrive_1920x1080_50.yuv"; FileStream fs = new FileStream(path, FileMode.Open); BinaryReader br = new BinaryReader(fs); //Sequence size, number of frames int width = 1920; int height = 1080; int frame = 10; int N = width * height; byte[] l..

[구현] "A Deep Convolutional Neural Network Approach for Complexity Reduction on Intra-Mode HEVC" 논문을 JEM 7.0에 구현하기 (2)

[References] Li, Tianyi, Mai Xu, and Xin Deng. "A deep convolutional neural network approach for complexity reduction on intra-mode HEVC." 2017 IEEE International Conference on Multimedia and Expo (ICME). IEEE, 2017. JEM reference software, https://jvet.hhi.fraunhofer.de/svn/svn_HMJEMSoftware/ J. Chen, E. Alshina, G. J. Sullivan, J.-R. Ohm, and J. Boyce, “Algorithm Description of Joint Explorati..

[논문 리뷰] CNN-based Motion Compensation

[References] Shuai Huo, Dong Liu, Feng Wu, Houqiang Li, “Convolutional Neural Network-Based Motion Compensation Refinement for Video Coding,” in IEEE International Symposium on Circuits and Systems (ISCAS), 2018, pp. 1–4. Zhenghui Zhao, Shiqi Wang, Shanshe Wang, Xinfeng Zhang, Siwei Ma, Jiansheng Yang, “CNN-Based Bi-Directional Motion Compensation for High Efficiency Video Coding,” in IEEE Inter..