Contribute Your Purified Proteins to Accelerate Open Data in Drug Discovery
The SGC and its partners are leading a transformative effort to revolutionize small-molecule hit discovery, but we can’t do it alone! Expanding the diversity of screened protein targets is critical, and we are inviting community domain experts around the world to contribute.
As part of the Target 2035 initiative, we will develop large, openly accessible datasets of high-quality protein-small molecule binding data. Using Affinity Selection Mass Spectrometry (AS-MS) and DNA-encoded chemical libraries (DEL), we will screen 2,000 human proteins over the next five years. The resulting data will be shared openly, challenging the machine learning community to predict new, diverse small molecule binders.
And this is where you come in! We are inviting you to be part of this effort and contribute purified proteins. The resulting data will be combined with other datasets to create a large, publicly available resource hosted through AIRCHECK, our rapidly growing cloud-based Artificial Intelligence-Ready CHEmiCal Knowledge base, ensuring global accessibility. The identity of any hits will be shared with you without restriction, supporting your research and aligning with our commitment to open data and ML/AI-driven drug discovery.
Collaboration Opportunities
While the default arrangement ensures that “hits” are returned to the donating scientist and the screening data is placed into the public domain after quality checks, we offer three optional collaboration opportunities:
- Benchmarking Challenge Participation: Opt-in to allow your data to be used in benchmarking challenges, with a slight delay in the public release to ensure high-quality standards.
- Collaboration with SGC Chemistry Teams: Opt-in to collaborate directly with SGC’s chemistry teams, contributing to the development of cutting-edge small molecule discovery methods.
- Industry Collaboration: Opt-in to share your research with our industry partners, opening the door to potential collaborations with leading pharmaceutical companies.
What We Need From You
Protein Requirements
- Protein amount: 120 nmol (equals to 3 mg of a 25 kDa protein)
- Minimum concentration and aliquot size: 25 μM and 50 μL
- Purity: >90% soluble, no aggregation, homogeneous sample
- Tags:
Mandatory: His-Tag, and Biotin (either through Avi-tag or another Tag/treatment).
Optional: Protease cleavage sites
Prohibited: No Solubility tags!
Characterization / QC Data Required

Submit your Protein
Note: Before submitting, please review both forms carefully.
Steps:
- Register through the initial Protein Intake Form.
- If your protein is suitable, we will invite you to fill out the Full Registration Form.
If you have any questions, contact proteins@thesgc.org. We look forward to receiving your submissions