RLHF vs. RLAIF: Fine-Tuning LLMs for Better Alignment (OTS, SFT, PPO, Jailbreak)Large Language Models (LLMs) like GPT-4, LLaMA 3, and Claude are redefining natural language processing. Despite their advancements…Jan 9Jan 9
Optimizing Azure OpenAI Service: Base Model Deployment, Fine-Tuning, and Decoding ParametersAzure OpenAI Service offers powerful tools to deploy, fine-tune, and interact with GPT models, making it essential to understand the…Dec 14, 2024Dec 14, 2024
RAG vs. Fine-Tuning : When to Use, Combine, and Optimize for Best ResultsWhen building or optimizing AI models, two powerful techniques often come into play: Fine-tuning and RAG (Retrieval-Augmented Generation)…Dec 13, 2024Dec 13, 2024
Paper Review — Debug like a Human: A Large Language Model Debugger via Verifying Runtime Execution…Debugging programs is essential yet challenging, even for advanced Large Language Models (LLMs). In their ACL 2024 paper, “Debug like a…Nov 14, 2024Nov 14, 2024
IMG2TEXT-Part2. OFA, CLIP Interrogator and ViTContinuing from Part 1, we are going to look into the CLIP Interrogator, OFA model, and ViT model and ensemble them. Most of the codes are…May 14, 2023May 14, 2023
IMG2TEXT-Part1. Background (Stable Diffusion, CLIP, Prompt)In this article, I’d like to talk about background information to implement CLIPInterrogator+OFA+ViT_LB0.568. Part 2 will cover the…May 13, 2023May 13, 2023
Google ISLR Transformer with W&B (Part 2)In this article, I’ll be showing you how to create and train a model for the Kaggle ASL (American Sign Language) recognition competition…Apr 24, 2023Apr 24, 2023
Google ASL 1. Process Data with W&B 🐝Today, I’m going to explain the dataset and how to process it for a Kaggle competition on ASL(American Sign Language), Google — Isolated…Apr 23, 2023Apr 23, 2023
Paper Review — Strided Transformer (TMM 2022)Strided Transformer is a monocular 3D pose estimation model which lifts a long sequence of 2D joint locations to a single 3D pose.Sep 9, 2022Sep 9, 2022
Paper Review — VideoPose3D (CVPR 2019)3D human pose estimation in video with temporal convolutions and semi-supervised trainingAug 23, 20221Aug 23, 20221