Junyeol Ryu

Researcher @ SNU CSE

Summary

Hello :) I'm Junyeol Ryu [Pronunciation: "june-yall" ("yeol" as in Good morning "yall" ☀️)].

I am currently a researcher at the Seoul National University. More info can be found in my CV and Research Summary.

My research interests lie in computer architectures and heterogeneous systems.

Contact: jyeol.ryu <at> gmail <dot> com

Publications

SPipe: Hybrid GPU and CPU Pipeline for Training LLMs under Memory Pressure

Preprint

Junyeol Ryu, Yujin Jeong, Daeyoung Park, Jinpyo Kim, Heehoon Kim, and Jaejin Lee

Paper
GitHub

TCCL: Discovering Better Communication Paths for PCIe GPU Clusters

ASPLOS, 2024

Heehoon Kim, Junyeol Ryu, and Jaejin Lee

Paper
GitHub

Network Contention-Aware Cluster Scheduling with Reinforcement Learning

ICPADS, 2023

Junyeol Ryu and Jeongyoon Eo

Paper
GitHub

Domestic publications

A Fast and Scalable Generative Model Inference on Distributed Multi-GPU Environment

Korean Computing Congress, 2023

Junyeol Ryu, Jinpyo Kim, and Jaejin Lee

Paper
GitHub

Investigating Contention Sensitivity of DL Training Workloads in Shared GPU Cluster

Korean Software Congress, 2022

Junyeol Ryu and Byung-Gon Chun

Paper
GitHub

Experience

Graduate Student Research Assistant

Thunder Research Group, SNU

Mar 2023 - Aug 2024

Advisor: Prof. Jaejin Lee

Built efficient software systems for deep learning. Created SPipe, a hybrid GPU-CPU pipeline for training LLMs under memory constraints that achieves average 1.26x speedup of Mobius. Participated in TCCL, a GPU collective communication library on PCIe-only GPU cluster that achieves up to 2.07x improved efficiency. Open-sourced FastGen, the step-by-step optimization of CUDA GEMM kernel that achieves 80.9% performance of closed-source cuBLAS.

Keywords:

MLSys
LLM
Training
HPC
Open Source

Graduate Student Research Assistant

Software Platform Lab, SNU

Sep 2022 - Feb 2023

Mar 2022 - Aug 2022 (Research Intern)

Advisor: Prof. Byung-Gon Chun

Focused on efficient scheduling of deep learning jobs in GPU clusters. Created two GPU cluster managers, GPack and DeepShare, which propose resouce-efficient packing using lightweight DNN and network contention mitigation using RL, respectively.

Keywords:

MLSys
Training
Cluster Management
Scheduling
Open Source

Software Engineer Intern

FriendliAI

Jan 2022 - Feb 2022

Participated in prototype web client development of Periflow, a serving engine for LLMs.

Keywords:

LLM
Inference

Software Engineer

Waffle Studio

Mar 2021 - Mar 2022

Created Guam, a team-matching mobile app that pairs programmers, project managers, and designers. Led frontend team until successful deployment on app markets and left.

Keywords:

Software Engineering
Mobile

Software Engineer

Vanilla Bridge

Jul 2020 - Dec 2020

Participated in Vanilla bridge, a dating app with emphasis on credibility by human matchmaker-based system. Focused on data-driven DevOps for service optimization by introducing data analysis with collected in-app user experience data.

Keywords:

Software Engineering
Mobile
Startup

Education

MS, Computer Science and Engineering

Seoul National University

Sep 2022 - Aug 2024
BS, Computer Science and Engineering

Summa cum laude

Seoul National University

Mar 2016 - Aug 2022

Proficiency

Languages

C++, Python
C, CUDA, OpenMP, OpenCL, MPI
Dart, JavaScript, TypeScript, Ruby

Tools and Frameworks

PyTorch
Django, Flutter, React, Vue
AWS, Firebase, BigQuery

Others

Commandline
GitHub
Open Source

Honors & Awards

Grand Prize in Samsung Computer Engineering Challenge

Fastest inference on HellaSwag with LLaMA-30B, in a server with four NVIDIA Tesla V100 GPUs and awarded $10,000 prize.

Nov 2023

Junyeol Ryu

Summary

Publications

SPipe: Hybrid GPU and CPU Pipeline for Training LLMs under Memory Pressure

TCCL: Discovering Better Communication Paths for PCIe GPU Clusters

Network Contention-Aware Cluster Scheduling with Reinforcement Learning

Domestic publications

A Fast and Scalable Generative Model Inference on Distributed Multi-GPU Environment

Investigating Contention Sensitivity of DL Training Workloads in Shared GPU Cluster

Experience

Graduate Student Research Assistant

Keywords:

Graduate Student Research Assistant

Keywords:

Software Engineer Intern

Keywords:

Software Engineer

Keywords:

Software Engineer

Keywords:

Education

Proficiency

Languages

Tools and Frameworks

Others

Honors & Awards

Teaching

Community Service

English Proficiency

Interests