건대다니는 컴공생
[CoIn] 논문 리뷰 | Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints (Komatsuzaki et al., 2022)