DPMiner

An Integrated Framework for Collecting Defect Data
by ISEL

Description

DP Miner Description

DP Miner is an Integrated Framework that can collect various types of data required for Defected Prediction. It can collect Repository List, Patch, Bug Introducing Commit (BIC), Metric.

For more information, please visit our github page, DPMinerGitHub. Thank you.

Image

Repository List

A list of repository URLs matching the conditions desired by the user is extracted from the version control system and the open source repository, GitHub. To extract the URL list, DP Miner use Search API among GitHub REST APIs. Possible conditions are as followed: commit count, recent date, fork number, language type, and author token.

Image

Patch/BIC

The patch is function to collects bug fixing commit(BFC). There are three ways to collect bug fixing commit (BFC): Jira, GitHub Issue, Commit Message. After collecting BFC (Bug Fix Commits) by the method described in Patch, BIC (Bug Introducing Commits) is collected by using SZZ algorithm. In this framework, two SZZ algorithms, B-SZZ and AG-SZZ, are used.

Image

Metric

The metric is information of source code for defect prediction. There are three representative metrics: Characteristic Vector, Bag of Words, Meta Data.






Image






Contact Us

DPMiner is designed and implemented by the Intelligent Software Engineering Lab (ISEL) at Handong Global University, Pohang, Gyeongsangbuk-do, Republic of Korea. The DPMiner engine is developed as an open source project and anybody can access the entire source code from the DPMiner GitHub repository. The DPMiner engine is fully functional as a CLI application and the DPMiner packages can be reused for other software engineering tools (Please access Javadoc here: http://isel.lifove.net/Javadoc/index.html).

If you have any issues when using our tool, please contact us (isetool@handong.edu) and report the issues here: https://github.com/ISEL-HGU/DPMiner/issues

iseltool@handong.edu