(2024-10) I'm on Bluesky @mcognetta.bsky.social.
(2024-09) Team π (Tyler Woodruff, Oleg Filatov, and myself) won the IEEE BigData Cup: Predicting Chess Puzzle Difficulty Challenge. The camera-ready paper and code will be available soon, and will be presented at IEEE BigData.
(2024-09) My paper, Distributional Properties of Subword Regularization (with VilΓ©m Zouhar and Naoaki Okazaki) was accepted to EMNLP. The preprint is here and the camera-ready version will be available soon.
(2024-02) My paper, Two Counterexamples to Tokenization and the Noiseless Channel, was accepted at LREC-COLING 2024. The preprint is here and the camera-ready version and code will be available soon.
(2023-07) I presented LotteryTickets.jl: Sparsify Your Flux Models at JuliaCon2023. The recording is here, and the slides and repo are here.
(2023-05) I presented Parameter-Efficient Korean Character-Level Language Modeling at EACL2023. The paper can be found here.
(2022-11) I've joined Mastodon on the sigmoid.social instance, which is focused on the ML/AI research community. My profile is @mc@sigmoid.social.
(2022-04) I've moved to Tokyo to join the Okazaki Lab for my PhD. I'll continue working part time at Google Tokyo.
(2024-12-13) The Lichess Game Compressor's Analysis of Game 1 of the World Championships
(2024-05-16) Spending Too Much Time Optimizing LeetCode's Path With Maximum Gold
(2023-11-20) Finding a Random Seed That Solves a LeetCode Problem
(2021-12-29) Solving (and Animating) Advent of Code Day 1 with μν¬
I am currently a PhD student in NLP in the Okazaki Lab at the Tokyo Institute of Technology, and a PhD Student researcher at Google Tokyo on the Gboard team. Prior to this, I was a software engineer at Google, also on Gboard. I did my MS in Computer Science at Yonsei University and my BS in Discrete Mathematics with a minor in Korean at Georgia Tech.
I am always open to chatting about interesting topics. Please feel free to send me an email ([lastname].[firstname]@gmail.com).
I am (not exhaustively) interested in:
Automata Theory
Scientific Computing
Languages (especially Korean and Esperanto)
Combinatorics
Open Source Software
High School Level Computer Science Education
μ λ λμΏκ³΅μ λν(Tokyo Institute of Technology)μ Okazaki μ°κ΅¬μ€μ λ°μ¬νμ νμμ΄κ³ κ΅¬κΈ μ§λ³΄λ(Gboard)νμ κ°λ°μ λ§λ₯΄μ½μ λλ€. μ‘°μ§μν μμ μ΄μ°μνμ μ 곡, νκ΅μ΄λ₯Ό λΆμ 곡νμκ³ μ΄νμ μ°μΈλνκ΅μ κ³μ°μ΄λ‘ μ°κ΅¬μ€μμ μ»΄ν¨ν° κ³Όν μμ¬νμλ₯Ό μλ£νμμ΅λλ€.
ν₯λ―Έλ‘μ΄ μ£Όμ κ° μλ€λ©΄, μΈμ λ λꡬμλ μ΄μΌκΈ° λλκ³ μΆμ΅λλ€. μ΄λ©μΌλ‘ μ°λ½ν΄μ£ΌμΈμ ([μ±].[μ΄λ¦]@gmail.com).
μ κ° νΉν μ’μνλ μ£Όμ λ€μ λ€μκ³Ό κ°μ΅λλ€:
μ€ν λ§ν μ΄λ‘
μμΉν΄μν
μΈμ΄ (νΉν νκ΅μ΄νκ³ μμ€νλν )
μ‘°ν©λ‘
μ€νμμ€ μννΈμ¨μ΄
κ³ λ±νκ΅ μμ€μ μ»΄ν¨ν°κ³Όνκ΅μ‘
λͺ¨λ κ²μλ¬Όμ μ£Όλ‘ μμ΄λ₯Ό μ¬μ©ν΄μ μ°κ³ μμ§λ§, νκ΅μ΄λ₯Ό μ°μ΅νκΈ° μν΄μ κ°λ νκΈ κ²μλ¬Όμ μμ±νκ±°λ μμ΄ κ²μλ¬Όλ€μ νκ΅μ΄λ‘ λ²μν΄μ μ¬λ¦¬κ³ μμ΅λλ€.