Rafail Giavrimis
Academic and research departments
Computer Science Research Centre, Faculty of Engineering and Physical Sciences.About
My research project
ML-Based Code Optimisation of Data Intensive ApplicationsResearching methods to utilise AI and LLM based approaches to automate software engineering and code optimisation.
Supervisors
Researching methods to utilise AI and LLM based approaches to automate software engineering and code optimisation.
Publications
Developers relax restrictions on a type to reuse methods with other types. While type casts are prevalent, in weakly typed languages such as C++, they are also extremely permissive. Assignments where a source expression is cast into a new type and assigned to a target variable of the new type, can lead to software bugs if performed without care. In this paper, we propose an information-theoretic approach to identify poor implementations of explicit cast operations. Our approach measures accord between the source expression and the target variable using conditional entropy. We collect casts from 34 components of the Chromium project, which collectively account for 27MLOC and random-uniformly sample this dataset to create a manually labelled dataset of 271 casts. Information-theoretic vetting of these 271 casts achieves a peak precision of 81% and a recall of 90%. We additionally present the findings of an in-depth investigation of notable explicit casts, two of which were fixed in recent releases of the Chromium project. •Information-theoretic approach to identify poor implementations of named casts.•Detecting poor naming choices for identifiers used in a cast operation.•Measuring accord between source and target identifiers using conditional entropy.•In-depth investigation of the use of C++ explicit cast operators from Chromium.•Provide open-source implementation and dataset of 271 manually labelled casts.
Additional publications
- Rafail Giavrimis, Alexis Butler, Constantin Cezar Petrescu, Michail Basios, and Santanu Kumar Dash. 2021. Genetic Optimisation of C++ Applications. In 2021 36th IEEE/ACM International Conference on Automated Software Engineering (ASE), IEEE, 1180–1182.
https://doi.org/10.1109/ASE51524.2021.9678650