Vulnerability Detection On Vulscriber
평가 지표
F1 Score
평가 결과
이 벤치마크에서 각 모델의 성능 결과
모델 이름 | F1 Score | Paper Title | Repository |
---|---|---|---|
Devign Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans) | 24.99 | - | - |
Reveal Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans) | 18.98 | - | - |
LineVul - Tested on BigVul (Training on Devign + VulScribeR 20K+ Extra Cleans) | 16.23 | - | - |
Devign Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans) | 18.51 | - | - |
LineVul - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans) | 17.38 | - | - |
Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans) | 26.18 | Exploring RAG-based Vulnerability Augmentation with LLMs |
0 of 6 row(s) selected.