Published June 1, 2023 | Version 20230601
Dataset Restricted

CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models

Creators

  • 1. Washington University in St. Louis

Description

This repository contains prompts generated by CodeIPPrompt, a platform used to assess potential intellectual property infringement risks associated with the output of code language models. The source code of the platform can be found at our GitHub repository: https://github.com/zh1yu4nyu/CodeIPPrompt. Detailed information regarding the datasets, as well as usage instructions, can be found in the README.md file.

The paper has been accepted by International Conference on Machine Learning (ICML) 2023. If you find this work helpful, please cite us as follows:

@inproceedings{yu2023codeipprompt,
  title={CodeIPPrompt: Intellectual Property Infringement Assessment of Code Language Models},
  author={Yu, Zhiyuan and Wu, Yuhao and Zhang, Ning and Wang, Chenguang and Vorobeychik, Yevgeniy and Xiao, Chaowei},
  booktitle={International Conference on Machine Learning},
  year={2023},
  organization={PMLR}
}

Files

Restricted

The record is publicly accessible, but files are restricted to users with access.

Request access

If you would like to request access to these files, please fill out the form below.

You need to satisfy these conditions in order for this request to be accepted:

Project website: https://sites.google.com/view/codeipprompt

This work focuses on the intellectual property of programs generated by code language models. As we are careful regarding the usage of our datasets and code, please indicate your purposes with the dataset as well as your contact information. Thank you!

You are currently not logged in. Do you have an account? Log in here

Additional details

References

  • Yu, Zhiyuan et al., "Codeipprompt: Intellectual property infringement assess- ment of code language models," in International conference on machine learning, PMLR, 2023