"topic:codebert" — Search

34 results for “topic:codebert”

CodeBERTScore: an automatic metric for code generation, based on BERTScore

Jupyter Notebook20715Updated 2 years ago

bertbertscorecodecode-bert-scorecode-bertscorecodebertcodebertscorescore

EVIL (Exploiting software VIa natural Language) is an approach to automatically generate software exploits in assembly/Python language from descriptions in natural language. The approach leverages Neural Machine Translation (NMT) techniques and a dataset that we developed for this work.

Python294Updated 4 years ago

assemblycodebertdatasetdecoderencoderexploitlinuxnmtseq2seqshellcodesoftware-exploitation

RepoAnalysis/RepoSnipy

Neural search engine for discovering semantically similar Python repositories on GitHub

Python297Updated 2 years ago

code-understandingcodebertgithub-repository-searchlanguage-modelneural-search-enginestreamlit-application

jorge-martinez-gil/small-code-models

Repository about small code models

Python232Updated 1 month ago

clone-detectionclone-detectorcode-analysiscode-analysis-toolscode-llmscode-similaritycodebertcodellmsgraphcodebertplbartpolycodert5-modelunixcoder

jorge-martinez-gil/graphcodebert-interpretability

Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks

Python221Updated 3 months ago

clone-detectioncodecode-analysiscode-similaritycodebertgraphcodebertinterpretabilitypca-analysissemantic-similaritysimilarity-measuresumap

philippnormann/malicious-payload-detection

🕵️‍♂️ ML project to identify malicious web payloads, aimed at boosting the effectiveness of WAFs and IDSs.

Jupyter Notebook150Updated 1 year ago

codebertcybersecurityfeature-engineeringintrusion-detection-systemmachine-learningpayload-detectionrandom-foresttransformersweb-application-firewallweb-security

dessertlab/Targeted-Data-Poisoning-Attacks

This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).

Python132Updated 1 year ago

code-generationcodebertdata-poisoning-attacksdatasetnmtpythonsoftware-security-assessmentvulnerabilities

EhsanMashhadi/ISSRE2023-BugSeverityPrediction

Code of our paper "Method-Level Bug Severity Prediction using Source Code Metrics and LLMs" which is accepted to ISSRE 2023.

Java102Updated 2 years ago

bertbert-modelbug-detectioncodebertdeep-learningembeddinglargelanguagemodelmachine-learningseverity-prediction

jorge-martinez-gil/ensemble-codesim

Advanced Detection of Source Code Clones via an Ensemble of Unsupervised Similarity Measures

Java102Updated 3 months ago

clone-codingclone-detectioncode-clonescode-intelligencecode-similaritycodebertgraphcodebertsemantic-similaritysemantic-similarity-measuressimilarity-measuressource-code-analysis

jorge-martinez-gil/graphcodebert-feature-integration

Improving Source Code Similarity Detection with GraphCodeBERT and Additional Feature Integration

Python105Updated 1 month ago

clone-detectioncode-similaritycodebertgraphcodebertsemantic-similaritysimilarity-measuressource-code-analysis

sssszh/Vulnerability-Detection

Fine-tuning CodeBERT for Vulnerability Detection

Python80Updated 1 year ago

codebertvulnerability-detection

ML4SE2022/Group4

Fine-tuning CodeBERT with AST-based Vectors for Code Translation

C#51Updated 3 years ago

astcodeberttranscompiling

RepoAnalysis/RepoSim

This repository contains experiments on comparing the similarity of Python repositories using ML models.

Jupyter Notebook42Updated 2 years ago

code-understandingcodebertlanguage-modelsemantic-searchtransformers

daimakram/Bug-Detection-Code-Summarization

Performs Code Summarization, Bug Detection, Bug Removal using different Natural language processing models including Garph CodeBERT, GREAT, GNN, CoText etc.

Jupyter Notebook30Updated 3 years ago

codebertnatural-language-processingsoftware-engineering

RepoMining/RepoSim4Py

A project for determining the similarity of python repositories based on embedding approach

Jupyter Notebook22Updated 1 year ago

code-understandingcodebertlanguage-modelsemantic-analysistransformers

MarttiWu/codeopt

CodeOpt: A framework for optimizing code performance using Two-Stage Sampling, Few-Shot Learning, and Iterative Self-Reflection with support for Genetic Algorithm Inspired Chain-of-Thought (GA-COT).

Python20Updated 1 year ago

bm25chain-of-thoughtcode-optimizationcode-performancecodebertfew-shot-learninggenetic-algorithmin-context-learningiterative-refinementllmnlppythonrefactoringrefactoring-toolsself-reflectionsemantic-similarity

AmitAK1/Neural-Vulnerability-Scanner

CodeBERT + LoRA fine-tuning for C/C++ vulnerability detection | F1 = 74.3% | PyTorch, HuggingFace Transformers, PEFT

Jupyter Notebook20Updated 1 month ago

codebertcybersecuritydeep-learningloramachine-learningnlppytorchtransformersvulnerability-detection

roshan112-3/Auto-grading-C-programming-assignments-with-CodeBERT-and-Random-Forest-Regressor

No description provided.

Jupyter Notebook10Updated 2 years ago

bert-embeddingscodebertdeep-learningnlp-machine-learningwordembeddings

sarvagyakrcs/s0.dev

The modern web development landscape is plagued by a peculiar paradox: despite the abundance of UI components and design systems, developers still spend countless hours reimplementing similar interfaces. S0 addresses this challenge by introducing a novel approach that combines advanced vector search capabilities.

Python10Updated 1 year ago

bertbuncodebertfastapimultimodal-embeddingsnextjspg-vectorretrival-augmented-generationsimilarity-search

hishamp3/codeDetection

Django implementation of CodeBERT for detecting vulnerable code.

Python10Updated 2 years ago

codebertdjango-frameworkhtml-csslarge-language-modelsllm-fine-tuning

Vaibhav06Jha28/ChainSage

"AI-powered vulnerability detection for Solidity smart contracts using Mistral + CodeBERT"

Python11Updated 7 months ago

aicodebertfastapimistralsecuritysmart-contractssoliditystreamlit

Ahmedfir/java-business-locations

extracts business-logic code locations.

Java12Updated 1 year ago

astbusiness-logic-componentcodebertmutation-testingnaturalness

shruti10-designer/NNDL-Autograding

Auto-grading of C programs using Machine Learning and Deep Learning models such as random forest, CNN, LSTM etc and code embedding models such as CodeBERT. Also published a paper for the same in IEEE (14th ICCNT Conference)

Jupyter Notebook10Updated 2 years ago

bilstmcnncodebertdeep-learninglstmmachine-learningnatural-language-processingneural-networkspythonregression-modelsrnnrobertasklearn-librarytensorflowtokenizerword-embeddings

Nghia9912/IntentTrace-xAI

A deterministic and neuro-symbolic framework for evaluating LLM-generated code using Abstract Syntax Trees, Semantic Embeddings, and Integrated Gradients. Think of it as a 'Digital Polygraph' for AI. It uses a three-step verification process to ensure the AI didn't 'misunderstand' your instructions

Python10Updated 2 weeks ago

code-analysiscodebertexplainable-aiintegrated-gradientsllm-evaluationpytorchsemantic-similarityxai

khushnood-rafique/Transformer-Based-Unit-Test-Generation

This study compares three transformer-based mod- els—CodeT5, CodeBERT, and CodeGen.

Python10Updated 8 months ago

codebertcodegencodet5nlp

blackscythe123/IRSE

The study uses the IRSE/FIRE dataset and explores the impact of combining original C code data with Python-derived silver-standard

Jupyter Notebook12Updated 2 weeks ago

classification-modelcodebertinformation-retrievalmlmmlmodel

bosszii2709/ai-dataset-generator

🤖 Generate tailored AI training datasets quickly and easily, transforming your domain knowledge into essential training data for model fine-tuning.

Python10Updated 2 hours ago

aicode-generationcodebertdata-poisoning-attacksdatasetdataset-generationfinetune-gptgpt4ogpt4o-minigradiollamallmnmtopenaiopenai-apipythonsoftware-security-assessmentvulnerabilities

Radowan98/ZSVulD

Implementation and dataset for A Zero-Shot Framework for Cross-Project Vulnerability Detection in Source Code (Empirical Software Engineering, 2026).

Python01Updated 4 months ago

codebertdeep-learningempirical-software-engineeringmachine-learningsoftware-securitytransfer-learningvulnerability-detectionzero-shot-learning

hishamp3/codeXGLUE

CodeXGLUE, a benchmark dataset to foster machine learning research for program understanding and generation.

Jupyter Notebook00Updated 2 years ago

codebertllmtransformers

ZakriaComputerEngineer/Automated-Bug-Report-Classification-to-Improve-Source-Code-Quality

This repository is source code of conference paper "From Bug Reports to Code Quality: A Transformer-Based Classification Approach"

Jupyter Notebook00Updated 9 months ago

bug-classificationcodebertquality-assurancesoftware-test-description

Page 1 of 2