SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub

Benjamin C. Carter, Jonathan Rivas Contreras, Carlos A. Llanes Villegas, Pawan Acharya, Jack Utzerath, Adonijah O. Farner, Hunter Jenkins, Dylan Johnson, Jacob Penney, Igor Steinmacher, Marco A. Gerosa, Fabio Santos

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

New contributors often struggle to find tasks that they can tackle when onboarding onto a new Open Source Software (OSS) project. One reason for this difficulty is that issue trackers lack explanations about the knowledge or skills needed to complete a given task successfully. These explanations can be complex and time-consuming to produce. Past research has partially addressed this problem by labeling issues with issue types, issue difficulty levels, and issue skills. However, current approaches are limited to a small set of labels and lack indepth details about their semantics, which may not sufficiently help contributors identify suitable issues. To surmount this limitation, this paper explores large language models (LLMs) and Random Forest (RF) to predict the multilevel skills required to solve the open issues. We introduce a novel tool, SkiliScope, which retrieves current issues from Java projects hosted on GitHub and predicts the multilevel programming skills required to resolve these issues. In a case study, we demonstrate that SkillScope could predict 217 multilevel skills for tasks with 91% precision, 88% recall, and 89% F-measure on average. Practitioners can use this tool to better delegate or choose tasks to solve in OSS projects. A demo video is available at https://youtu.be/gqU/vDcT_0o

Original languageEnglish (US)
Title of host publicationProceedings - 2025 IEEE/ACM International Workshop on Natural Language-Based Software Engineering, NLBSE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages9-12
Number of pages4
ISBN (Electronic)9798331538644
DOIs
StatePublished - 2025
Event2025 IEEE/ACM International Workshop on Natural Language-Based Software Engineering, NLBSE 2025 - Ottawa, Canada
Duration: Apr 27 2025 → …

Publication series

NameProceedings - 2025 IEEE/ACM International Workshop on Natural Language-Based Software Engineering, NLBSE 2025

Conference

Conference2025 IEEE/ACM International Workshop on Natural Language-Based Software Engineering, NLBSE 2025
Country/TerritoryCanada
CityOttawa
Period4/27/25 → …

Keywords

  • large language models
  • machine learning
  • open source software (OSS)
  • skill categorization
  • software engineering

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'SkillScope: A Tool to Predict Fine-Grained Skills Needed to Solve Issues on GitHub'. Together they form a unique fingerprint.

Cite this