HyperAI超神経

Vulnerability Detection On Vulscriber

評価指標

F1 Score

評価結果

このベンチマークにおける各モデルのパフォーマンス結果

モデル名
F1 Score
Paper TitleRepository
Devign Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)24.99--
Reveal Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)18.98--
LineVul - Tested on BigVul (Training on Devign + VulScribeR 20K+ Extra Cleans)16.23--
Devign Model - Tested on Bigvul (Training on Devign + VulScribeR 20K + Extra Cleans)18.51--
LineVul - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)17.38--
Reveal Model - Tested on Reveal (Training on Devign + VulScribeR 20K + Extra Cleans)26.18Exploring RAG-based Vulnerability Augmentation with LLMs
0 of 6 row(s) selected.