Riley LearningIMG2TEXT-Part2. OFA, CLIP Interrogator and ViTContinuing from Part 1, we are going to look into the CLIP Interrogator, OFA model, and ViT model and ensemble them. Most of the codes are…May 14, 2023May 14, 2023
Riley LearningIMG2TEXT-Part1. Background (Stable Diffusion, CLIP, Prompt)In this article, I’d like to talk about background information to implement CLIPInterrogator+OFA+ViT_LB0.568. Part 2 will cover the…May 13, 2023May 13, 2023
Riley LearningGoogle ISLR Transformer with W&B (Part 2)In this article, I’ll be showing you how to create and train a model for the Kaggle ASL (American Sign Language) recognition competition…Apr 24, 2023Apr 24, 2023
Riley LearningGoogle ASL 1. Process Data with W&B 🐝Today, I’m going to explain the dataset and how to process it for a Kaggle competition on ASL(American Sign Language), Google — Isolated…Apr 23, 2023Apr 23, 2023
Riley LearningPaper Review — Strided Transformer (TMM 2022)Strided Transformer is a monocular 3D pose estimation model which lifts a long sequence of 2D joint locations to a single 3D pose.Sep 9, 2022Sep 9, 2022
Riley LearningPaper Review — VideoPose3D (CVPR 2019)3D human pose estimation in video with temporal convolutions and semi-supervised trainingAug 23, 20221Aug 23, 20221
Riley Learning[PyTorch] Simple 3D Pose Baseline implementation (ICCV’17)In this post, I review Simple 3D Pose Baseline (A simple yet effective baseline for 3d human pose estimation, also called as SIM) which is…Aug 4, 2022Aug 4, 2022
Riley LearningHRNet : Code ExplainedHRNet(Deep High-Resolution Representation Learning for Human Pose Estimation) is a state-of-the-art algorithm in the field of semantic…Aug 4, 2022Aug 4, 2022
Riley LearningIn this post, we create a simple convolutional neural network(SimpeConvNet) using only NumPy and…Jun 26, 2022Jun 26, 2022