Selected Publications


[NAACL] Seong-Jin Park, Youn-Gyu Jin*, Hyun-Young Moon*, Choi Bong-Hyuck, Lee Seung Hwan, Ohjoon kwon, Kang-Min Kim (* equal contributions),* ”Conflict and Overlap Classification in Construction Standards Using a Large Language Model” (Industry Track)**, **2025 Annual Conference of the North American Chapter of the Association for Computational Linguistics, TBA, 2025.

[JMIR] Seong-Ho Ahn, Kwangil Yim*, Hyun-Sik Won, Kang-Min Kim†, Dong-Hwa Jeong† (* equal contributions, † corresponding authors),* ”Discovering Time-Varying Public Interest for COVID-19 Case Prediction in South Korea Using Search Engine Queries: Infodemiology Study”, Journal of Medical Internet Research, Vol. 26, pp.1-17, December 2024. ****

[EMNLP] Jae-Woo Park, Seong-Jin Park*, Hyun-Sik Won, Kang-Min Kim (* equal contributions),* ”Large Language Models are Students at Various Levels: Zero-shot Question Difficulty Estimation”, Findings of the Association for Computational Linguistics: EMNLP 2024, pp.8157-8177, Miami, USA, November 2024. ****

[ICWS] Hyun-Sik Won, Su-Min Roh*, Dohyun Kim, Min-Ji Kim, Kang-Min Kim (* equal contributions),* ”EXTRA: Integrating External Knowledge into Multimodal Hashtag Recommendation System” (WIP), Proc. of IEEE International Conference on Web Services (ICWS), pp.719-721, Chicago, USA, July 2023.

[EACL] San-Hee Park, Kang-Min Kim*, O-Joun Lee, Youjin Kang, Jaewon Lee, Su-Min Lee, SangKeun Lee (* equal contributions),* ””Why do I feel offended?” - Korean Dataset for Offensive Language Identification”, Findings of the Association for Computational Linguistics: EACL 2023, pp.1112-1123, May 2023.

[JMIR] Junyeong Heo, Youjin Kang*, SangKeun Lee, Dong-Hwa Jeong†, Kang-Min Kim† (* equal contributions, † corresponding authors),* ”An Accurate Deep Learning-based System for Automatic Pill Identification: Model Development and Validation”, Journal of Medical Internet Research, Vol. 25, pp.1-13, January 2023. ****

[EMNLP] Mingyu Lee, Jun-Hyung Park*, Junho Kim, Kang-Min Kim, SangKeun Lee (* equal contributions),* ”Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking”, Proc. of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.7417–7427, Abu Dhabi, UAE, December 2022. ****

[ACL] Yong-Ho Jung, Jun-Hyung Park, Joon-Young Choi, Mingyu Lee, Junho Kim, Kang-Min Kim, SangKeun Lee, ”Learning from Missing Relations: Contrastive Learning with Commonsense Knowledge Graphs for Commonsense Inference”, Findings of the Association for Computational Linguistics: ACL 2022, pp.1514-1523, Dublin, Ireland, May 2022.

[EMNLP] San-Hee Park, Kang-Min Kim***,** Seonhee Cho*, Jun-Hyung Park, Hyuntae Park, Hyuna Kim, Seongwon Chung, SangKeun Lee (* equal contributions),* ”KOAS: Korean Text Offensiveness Analysis System” (Demo), Proc. of the 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp.72-78, Punta Cana, Dominican Republic, November 2021. ****

[EMNLP] Kang-Min Kim, Bumsu Hyeon, Yeachan Kim, Jun-Hyung Park, SangKeun Lee, ”Multi-pretraining for Large-scale Text Classification”, Findings of the Association for Computational Linguistics: EMNLP 2020, pp.2041-2050, November 2020. ****

[ACL] Yeachan Kim, Kang-Min Kim, SangKeun Lee. ”Adaptive Compression of Word Embeddings”, Proc. of Annual Conference of the Association for Computational Linguistics (ACL), pp.3950-3959, Seattle, USA, July 2020. ****

[BigData] Song-Eun Lee, Kang-Min Kim, Woo-Jong Ryu, Jemin Park, SangKeun Lee, “From Text Classification to Keyphrase Extraction for Short Text” (Short), Proc. of IEEE International Conference on Big Data (BigData), pp.1137-1142, Los Angeles, USA, December 2019.

[WWW] Kang-Min Kim, Yeachan Kim, Jungho Lee, Ji-Min Lee, SangKeun Lee, ”From Small-scale to Large-scale Text Classification”, Proc. of International Conference on World Wide Web (WWW), pp.853-862, San Francisco, USA, May 2019. ****

[IC] Kang-Min Kim, Woo-Jong Ryu, Jun-Hyung Park, SangKeun Lee, ”meChat: In-Device Personal Assistant for Conversational Photo Sharing”, IEEE Internet Computing, Vol.23, Issue.2, pp.23-30, March/April 2019.

[COLING] Yeachan Kim, Kang-Min Kim, Ji-Min Lee, SangKeun Lee. ”Learning to Generate Word Representations using Subword Information”, Proc. of the 27th International Conference on Computational Linguistics (COLING), pp.2551-2561, New Mexico, USA, August 2018.

[PAKDD] Kang-Min Kim, Dinara Aliyeva, Byung-Ju Choi, SangKeun Lee, ”Incorporating Word Embeddings into Open Directory Project Based Large-Scale Classification”, Proc. of the 22nd Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pp.376-388, Melbourne, Australia, June 2018.

[MobiSys] Jung-Hyun Lee, So-Young Jun, So-Jung Park, Kang-Min Kim, SangKeun Lee, ”Demo: Mobile Contextual Advertising Platform based on Tiny Text Intelligence” (Demo), Proc. of ACM International Conference on Mobile Systems, Applications and Services (ACM MobiSys), page 181, Niagara Falls, USA, June 2017. ****