TY - GEN
T1 - CoNCRA
T2 - 34th Brazilian Symposium on Software Engineering, SBES 2020
AU - De Rezende Martins, Marcelo
AU - Gerosa, Marco Aurelio
N1 - Publisher Copyright:
© 2020 ACM.
PY - 2020/10/21
Y1 - 2020/10/21
N2 - Software developers routinely search for code using general-purpose search engines. However, these search engines cannot find code semantically unless it has an accompanying description. We propose a technique for semantic code search: A Convolutional Neural Network approach to code retrieval (CoNCRA). Our technique aims to find the code snippet that most closely matches the developer's intent, expressed in natural language. We evaluated our approach's efficacy on a dataset composed of questions and code snippets collected from Stack Overflow. Our preliminary results showed that our technique, which prioritizes local interactions (words nearby), improved the state-of-The-Art (SOTA) by 5% on average, retrieving the most relevant code snippets in the top 3 (three) positions by almost 80% of the time. Therefore, our technique is promising and can improve the efficacy of semantic code retrieval.
AB - Software developers routinely search for code using general-purpose search engines. However, these search engines cannot find code semantically unless it has an accompanying description. We propose a technique for semantic code search: A Convolutional Neural Network approach to code retrieval (CoNCRA). Our technique aims to find the code snippet that most closely matches the developer's intent, expressed in natural language. We evaluated our approach's efficacy on a dataset composed of questions and code snippets collected from Stack Overflow. Our preliminary results showed that our technique, which prioritizes local interactions (words nearby), improved the state-of-The-Art (SOTA) by 5% on average, retrieving the most relevant code snippets in the top 3 (three) positions by almost 80% of the time. Therefore, our technique is promising and can improve the efficacy of semantic code retrieval.
KW - code search
KW - joint embedding
KW - neural networks
UR - http://www.scopus.com/inward/record.url?scp=85099371218&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85099371218&partnerID=8YFLogxK
U2 - 10.1145/3422392.3422462
DO - 10.1145/3422392.3422462
M3 - Conference contribution
AN - SCOPUS:85099371218
T3 - ACM International Conference Proceeding Series
SP - 526
EP - 531
BT - Proceedings - 34th Brazilian Symposium on Software Engineering, SBES 2020
PB - Association for Computing Machinery
Y2 - 21 October 2020 through 23 October 2020
ER -