Welcome to SoyDNGP, our webserver powered by convolutional neural network-based algorithms. This platform was developed through the meticulous analysis of genotypes and phenotypes from 5,000 randomly selected soybean accessions.
The genotype data were sourced and filtered using the SoySNP50K chip available on the SoyBase webserver. For more details on this, please visit the SoyBase SNP page here. The phenotypes for the 11 agronomy traits were downloaded from the GRIN-Global website.
If you're a first-time user of our prediction function, please take note of the following important tips:
Compatibility: To accommodate a broad spectrum of soybean genotypes, our modules were trained using a carefully selected set of 34,624 SNPs, based on the overlap of the SoySNP50K chip and resequencing data of soybean. These SNPs are built into our webserver and automatically selected when you submit your VCF files. Missing loci are filled using the reference base, which in our case is version 2 of the Williams 82 genome. Please note that if your reference is a different soybean genome version (like version 4), you'll need to convert it to version 2 to ensure accurate and reliable results.
Missing Data: If you're submitting a large number of samples (in the hundreds) for phenotype prediction, we recommend using Beagle software to phase your VCF file first. This will help you to capture as many valid SNPs as possible.
Large-scale Predictions: If you have thousands of soybean samples and wish to conduct local predictions, we've got you covered! We offer open-source code and pre-trained modules for local usage. Should you encounter any issues or questions, our dedicated team is ready and eager to assist.
File Submissions: Our server is smart! If you submit the same file within the same day, it will recognize it as a duplicate, saving you the hassle of re-submitting.We hope you enjoy your journey with SoyDNGP, and look forward to joining you in unearthing the hidden secrets of soybean genetics.
Soybean is a globally significant crop, playing a vital role in human nutrition and agriculture. Its complex genetic structure and wide trait variation, however, pose challenges for breeders and researchers aiming to optimize its yield and quality. Addressing this biological complexity requires innovative and accurate tools for trait prediction. In response to this challenge, we have developed SoyDNGP, a Convolutional Neural Networks (CNN)-based model that offers significant advancements in the field of soybean trait prediction. Compared to existing methods, such as DeepGS and DNNGP, SoyDNGP boasts a distinct advantage due to its lower parameter volume and superior predictive accuracy. Through rigorous performance comparison, including prediction accuracy and model complexity, SoyDNGP consistently outperformed its counterparts. Furthermore, it effectively predicted complex traits with remarkable precision, demonstrating robust performance across different sample sizes and trait complexities. We also tested the versatility of SoyDNGP across multiple crop species, including Cotton, Maize, Rice, and Tomato. Our results showed its consistent and comparable performance, emphasizing SoyDNGP's potential as a versatile tool for genomic prediction across a broad range of crops. To enhance its accessibility to users without extensive programming experience, we have designed a user-friendly web server, available at http://xtlab.hzau.edu.cn/SoyDNGP. The server provides two primary features: 'Trait Lookup', offering users the ability to access pre-existing trait predictions for over 500 soybean accessions, and 'Trait Prediction', allowing for the upload of VCF files for trait estimation. By providing a high-performing, accessible tool for trait prediction and genomic analysis, SoyDNGP opens up new possibilities in the quest for efficient and optimized soybean breeding.
Now we have served 3020 users.