Genome-wide analyses reveal intricate genetic mechanisms underlying egg production efficiency in chickens

Jianlin Han

doi:10.1186/s40104-025-01245-2

Research

Genome-wide analyses reveal intricate genetic mechanisms underlying egg production efficiency in chickens

¹ ,¹ ,¹ ,¹ ,¹ ,¹ ,^1,2 ,^2,3 ,⁴ ,^4,5 ,⁶ ,⁷

1.State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China

2.National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, 100193, China

3.College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China

4.College of Animal Science and Technology, Henan Agricultural University, Zhengzhou, 450046, China

5.College of Animal Science and Veterinary Medicine, Henan Institute of Science and Technology, Xinxiang, 453003, China

6.Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA

7.Yazhouwan National Laboratory, Sanya, 572024, China

Corresponding author: Dandan Wang (wdd13938406174@163.com)

Contributions: Authors and Affiliations State Key Laboratory of Animal Biotech Breeding, College of Biological Sciences, China Agricultural University, Beijing, 100193, China Lizhi Tan, Xinyu Cai, Yuan Kong, Zexuan Liu, Zilong Wen, Lina Bu & Yiqiang Zhao College of Animal Science and Technology, China Agricultural University, Beijing, 100193, China Yuzhan Wang National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University, Beijing, 100193, China Yuzhan Wang & Yiqiang Zhao College of Animal Science and Technology, Henan Agricultural University, Zhengzhou, 450046, China Xiaojun Liu & Dandan Wang Department of Crop and Soil Sciences, Washington State University, Pullman, WA, USA Zhiwu Zhang Yazhouwan National Laboratory, Sanya, 572024, China Jianlin Han College of Animal Science and Veterinary Medicine, Henan Institute of Science and Technology, Xinxiang, 453003, China Dandan Wang Authors Lizhi Tan View author publications Search author on: PubMed Google Scholar Xinyu Cai View author publications Search author on: PubMed Google Scholar Yuan Kong View author publications Search author on: PubMed Google Scholar Zexuan Liu View author publications Search author on: PubMed Google Scholar Zilong Wen View author publications Search author on: PubMed Google Scholar Lina Bu View author publications Search author on: PubMed Google Scholar Yuzhan Wang View author publications Search author on: PubMed Google Scholar Xiaojun Liu View author publications Search author on: PubMed Google Scholar Zhiwu Zhang View author publications Search author on: PubMed Google Scholar Jianlin Han View author publications Search author on: PubMed Google Scholar Dandan Wang View author publications Search author on: PubMed Google Scholar Yiqiang Zhao View author publications Search author on: PubMed Google Scholar Contributions LT performed the majority of computational analysis and drafted the manuscript. LT, XC, YK, ZL and ZW performed computational analysis and explained the results. LB pre-processed genotypes. DW collected samples and phenotypes. YW provided server computing support and participated in critical discussions. XL and YZ participated in experimental design and critical discussions. ZZ and JH participated in the writing of the article. JH advised on the study design. DW and YZ conceived the study, revised the manuscript and provided overall supervision. All authors read and approved the final manuscript. Corresponding authors Correspondence to Dandan Wang or Yiqiang Zhao .

Abstract

Background

Compared to many other vertebrates, chickens have a high reproductive efficiency in terms of egg production. The classic traits for evaluating egg-laying performance include age at first egg, egg number, clutch size, laying rate, etc. These egg-laying traits were not specifically designed to characterize egg production efficiency and stability. By considering the stage-specific variations in the egg production curve, this study aims to investigate the genetic mechanisms that directly influence the efficiency of egg production at each stage of the laying cycle.

Results

Using whole-genome sequencing data, we perform comprehensive genome-wide association study for 39 traits that focus on egg production efficiency and stability in the Gushi chicken. We showed that the haplotype-based approach is more effective for genetic mapping and capturing polygenic architecture. By combining the signals of Singleton Density Score (SDS), which is a population-genetic statistic designed to detect recent selection by leveraging the distribution of singletons, and association analyses, multiple egg-laying traits related to egg production efficiency were found to have experienced polygenic selection. Consistently, functional analysis of associated genes demonstrates that egg production efficiency benefits from multiple physiological functions. Furthermore, our results identified theCNNM2gene, known for its role in magnesium homeostasis, plays a dual role in egg production variance, promoting variability during the up-stage while reducing it during the sustained-stage to optimize egg production efficiency.

Conclusions

Collectively, our multiple genome analyses reveal a complex genetic mechanism underlying more efficient and stable egg production, and establish chicken genetics as a model for studying reproductive efficiency across species.

Keywords

Developmental GeneticsEgg production efficiencyEpistasisGenetic InteractionGenetic VariationGeneticsGenome-wide association studyPolygenic architecturePolygenic selectionQuantitative trait

Background

Egg production in chickens is primarily controlled and regulated by the hypothalamic-pituitary-ovary (HPO) axis which govern the dynamics of ovulation, egg formation, and oviposition [1]. To understand the genetic basis of egg production, traditional egg-laying traits such as age at first egg, egg number, clutch size and laying rate, were widely investigated by quantitative trait loci (QTL) mapping and genome-wide association study (GWAS) [2,3,4,5]. A large number of genetic variants have been identified as significant contributors to these traits, with the goal of enhancing egg production [6]. Egg production process is intricate and multifaceted. Beyond identifying key variants and associated genes, our recent study shows that the inter-tissue crosstalk of endocrine factors with the HPO axis plays an indispensable role in the indirect regulation of egg production [7].

Egg production throughout a hen's laying cycle can be characterized by egg production curves, which typically encompass stages from onset to peak, sustained production, and eventual decline [8]. From the perspective of the curve, a viable strategy to enhance egg production involves optimizing performance at each stage of the laying cycle [9]. This includes accelerating the rapid increase to peak production, maintaining peak levels to delay decline, thereby improving overall egg production while extending the laying cycle. Despite considerable efforts, the intricate genetic mechanisms underlying egg production remain poorly understood from these perspectives. In addition, enhancing egg production requires thorough investigation of the genetic architecture of egg-laying traits, including their effect sizes, polygenicity and possible pleiotropy [10]. A comprehensive study of genetic basis of complex egg-laying traits is therefore of great importance for deciphering gene-trait relationship and making informed decisions towards advancing breeding strategies [5, 11].

In this work, we extended our previous study on the genetic mechanisms underlying egg production by investigating 39 newly derived traits that focus on egg production efficiency and stability. Egg production stability here refers to reduced variance in production traits and consistent performance, while egg production efficiency in this study is defined as the hen’s ability to produce a high number of eggs consistently over the entire laying cycle. We consider egg production efficiency to encompasses two key dimensions: (1) a high effective clutch intensity (ECI), which indicates strong laying persistence during a clutch period, and (2) specific patterns in the laying curve, characterized by a rapid increase to peak production during the up-stage, followed by a prolonged period of stable egg production during the sustained-stage. Through comprehensive genome-wide association studies, we identified novel genetic signals and functionally relevant associated genes, and effectively elucidated the polygenic structure of egg-laying traits. We found that egg production variance traits experienced positive selection during the up-stage of laying, while egg production variance was selected against during the sustained-stage. Consistent with this principle, canonical correlation-based association analysis identified magnesium transporter gene CNNM2 (Cyclin and CBS Domain Divalent Metal Cation Transport Mediator 2) to play a dual role on egg production variance by promoting the variance in the up-stage while decreasing the variance in the sustained-stage. Together, these findings reveal an intricate genetic mechanism that promotes more efficient and stable egg production. Our research advances the understanding of avian reproductive biology and provides valuable insights for improving egg production in chickens.

Materials and methods

Ethics statement

All animal experimental protocols were approved by the Institutional Animal Care and Use Committee of Henan Agricultural University (protocol number 11-0085). The methods were carried out in accordance with the approved guidelines.

Sequencing and genotyping

For the resequencing data of 900 chickens, the reads obtained are of high quality with an average sequencing depth of 5.5 × [7]. The highest-quality mapping reads were calibrated using fastp with default parameters [12]. The reads were mapped against the chicken reference genome GRCg6a (GenBank accession No. GCF_000002315.6) using GTX-align with default parameters, a commercial FPGA-accelerated version of BWA [13]. Mapped reads were sorted and de-duplicated using SAMtools (V.1.3.167) [14].

The BaseVar-STITCH pipeline was used to identify polymorphic loci and impute genotypes from the mapped bam files [15]. The BaseVar-STITCH pipeline consists of two steps: step 1, variant calling with BaseVar: “python BaseVar.py basetype -R Gallus6.fa --regions chr --align-file-list bam_list --output-vcf chr_vcf.gz -output-cvg chr_cvg.tsv.gz --nCPU 6 --filename-has-samplename --smart-rerun”; step 2, genotype imputation with STITCH: “Rscript STITCH.R --chr 1 --bamlist bam_list -reference Gallus6.fa --K 28 --nGen 12 --nCores 20 --niterations 40 --outputdir chr1 --tempdir tmp1 --posfile chr1_ posfile”. The steps are the same as in our previous work [7]. PLINK (V1.90) [16] software was then used to screen for sample quality by setting the sample call rate (> 97%) and filter out low-quality SNPs, including SNPs with a call rate < 98%, minor allele frequency (MAF) < 0.02, and Hardy–Weinberg equilibrium P < 1 × 10^–6. Finally, 888 individuals and 13,494,226 autosomal SNPs were qualified for subsequent analysis. All qualified SNPs were pruned using the parameter --indep-pairwise 50 10 0.6 in PLINK (V1.90) to obtain 3,616,394 independent SNPs for SNP-based GWAS and CCA-based GWAS analyses.

Haplotype phasing were performed using Beagle (V5.0) [17] with the parameters: beagle.jar gt = file.vcf out = file.phased gp = true. For phased data, 7,065,827 SNPs were obtained by using the parameter --indep-pairwise 50 10 0.98 in PLINK. The prune threshold of 0.98 was chosen to filter out completely linked SNPs. The steps are the same as in our previous work [7]

Imputation of missing original egg number

A total of 888 Gushi chickens were used for the GWAS of egg-laying traits. The 12^th generation Gushi hens were obtained from the core breeding population of the Gushi Chicken Breeding Farm at Henan Sangao Agriculture and Animal Husbandry Co., Ltd., Henan Province, China. The population consisted of two batches of chickens, raised until 12 weeks of age, and then transferred to single cages in the same coop. Egg numbers were recorded daily until 43 weeks of age. Missing data from 21 to 43 weeks were imputed using the Random Forest method, implemented in R package ‘randomForest' using parameters: ntree = 200, maxiter = 10, mtry = 10. The Random Forest method uses the complete data as the training set. It begins by imputing average values for missing data, starting with columns that have the fewest missing data points. The process then iterated until the parameters converged (Supplementary Fig. S14).

Egg production rate curve and derived egg-laying traits

In contrast to the conventionally delineated egg-laying period used by Wang et al. [7] we introduced a curve-fitting strategy to explore the dynamic egg-laying pattern. Three egg production rate models (Wood model [18], compartmental model [19], and Yang-Ning model [20]) were employed to fit the average egg production rate curve based on individual egg production data. The model expressions are as follows:

Wood model: $y_t=at^be^{-ct}$

compartmental model: $y_t=a{(1-e}^{-c(t-d)})e^{-bt}$

Yang-Ning model: $y_t=ae^{-bt}/(1+e^{-c\left(t-d\right)})$

Where t is the week of age, ${{y}}_{{t}}$ is the laying rate at week t, $a$, $b$, $c$ and $d$ are the parameters to be measured, respectively. In the Yang-Ning model, $a$ denotes the maximum potential of egg production rate, $b$ denotes the rate of decline, $c$ denotes the rate of increase, and $d$ denotes the week of age at first egg.

According to the fitted curve, the egg production process was divided into three stages: up-stage (21–26 weeks), sustained-stage (27–43 weeks), and all-stage (21–43 weeks). Based on the daily egg number records, we generated a serial of newly derived traits to characterize of the egg-laying feature compared to our previous study. They are 5 production traits, including egg production variance (EV), weekly maximum laying rate (WMLR), weekly egg production variance (WEV), bi-weekly maximum laying rate (BWMLR) and bi-weekly egg production variance (BWEV). Three traits related to clutch, including clutch period number (CPN), total clutch size (TCS), and effective clutch intensity (ECI). Five traits related to interval, including laying interval time (LIT), average inter-laying interval (AILI), total inter-laying interval (TILI), maximum inter-laying interval (MILI), and interclutch interval (II). Supplementary Table S1 describes the definition and descriptive statistics of each derived trait. For each derived trait, a 3σ criteria was applied to remove outlier individuals to ensure the accuracy of the subsequent GWAS analyses.

The complex derived traits ECI (defined as TCS divided by CPN) and II (defined as the number of intervals divided by CPN) are calculated under the condition that each clutch includes at least two consecutive laying days:

$$ECI=\frac{{TCS}_{c\geq2}}{{CPN}_{c\geq2}}$$

$$II=\frac{\sum_{t_2}^{t_1}i}{{CPN}_{c\geq2}}$$

where ${t}_{1}$ denotes the start of the laying age (21 weeks), ${t}_{2}$ denotes the end of the laying age (43 weeks); ${c}$ represents the length of clutch in a clutch period; $\textit{i}$ represents the number of interval time in a clutch period [21].

SNP-based GWAS

Population stratification is one of the major causes for spurious phenotype-genotype associations in GWAS analysis. The projection on PC1 and PC2 showed a high overlap of individuals originating from two batches (Supplementary Fig. S15), suggesting that population stratification between batches was largely absent. Therefore, individuals from different batches were combined in the subsequent analyses. To err on the side of caution, we included the first 10 principal components (PCs) as covariates in the SNP-based GWAS analysis to control for possible confounds. The genomic relationship matrix (GRM) is a matrix that represents genetic similarity between individuals, calculated using genomic markers. It helps control for false positives by modeling the random effects of polygenic background.

After genotype imputation and quality control, 3,616,394 SNPs were obtained genome-wide. SNP-based GWAS analysis was performed for the 39 traits for the population of 843–888 chickens. SNP-based GWAS for egg-laying traits was conducted using fastGWA method [22] implemented in GCTA (V1.94.1), under the mixed linear model:

$${\varvec{y}}={\varvec{Q}}\boldsymbol{\alpha }+{\varvec{x}}{\varvec{\beta}}+{\varvec{g}}+{\varvec{e}}$$

where ${\varvec{y}}$ is the vector of egg-laying trait, ${\varvec{Q}}$ is the design matrix of covariates, including the top 10 PCs calculated from genome-wide SNPs by GCTA (V1.94.1), and hatch batch to correct population stratification; ${\varvec{x}}$ is the vector of genotype encoded by 0, 1, or 2; ${\varvec{g}}$ is the polygenic effect captured by GRM calculated from genome-wide SNPs; $\boldsymbol{\alpha }$ and $\boldsymbol{\beta}$ are the vectors of corresponding effect size; and ${\varvec{e}}$ is the residual. Genome-wide significant P values were corrected by FDR (false discovery rate) $\le$ 0.05. The Manhattan and Q-Q plots of egg-laying traits were plotted using the R package ‘ggplot2' and ‘CMplot' [23]. LD of GWAS signal interval was presented by LDBlockShow software (V1.40) [24].

Haplotype-based GWAS

The genome was divided into 1,413,153 blocks of five successive SNPs. Haplotypes within a block were retrieved and each diploid individual was coded by combination of two haplotype alleles. Haplotype-based GWAS analysis was performed using the R package ‘lme4qtl' [25]. The model can be written as:

$${\varvec{y}}={{\varvec{Q}}\boldsymbol{\alpha }+{\varvec{x}}}_{{\varvec{h}}}{\varvec{\beta}}+{{\varvec{g}}}_{{\varvec{h}}}+{\varvec{e}}$$

where ${\varvec{y}}$ is the vector of egg-laying trait, ${\varvec{Q}}$ is the design matrix of covariates, including the birth batch; ${{\varvec{x}}}_{{\varvec{h}}}$ is the haplotype combination as categorical variable; ${{\varvec{g}}}_{{\varvec{h}}}$ is the polygenic effect captured by GRM calculated from genome-wide haplotypes; $\boldsymbol{\alpha }$ is the effect for the covariates; ${\varvec{\beta}}$ is the fixed effect of haplotype combination to be tested for association; and ${\varvec{e}}$ is the residual. To enhance computational efficiency, we first constructed a null model excluding haplotype combination effects (incorporating only haplotype GRM random effects) using lme4qtl. This null model was established once in advance. Subsequently, the model residual was used as new phenotypes to build simple linear models. These models were used to test the overall statistical significance of each haplotype block.

The haplotype-based GRM was calculated with reference to method 1 described by Ferdosi et al. [26]. For each block i, we assigned a value of 1 when two haplotype alleles are equal and 0 when they are not, to obtain the haplotype relationship matrix ${\Gamma }_{i}$. The genome-wide haplotype matrix $\boldsymbol\Gamma$ is obtained as: $\boldsymbol\Gamma=\sum_{i=1}^n{\boldsymbol\Gamma}_i/n$, where n is the number of haplotype blocks in the genome. We finally converted the haplotype relationship matrix into an individual-level relationship matrix using: $\boldsymbol{H}=\boldsymbol{K}\boldsymbol{\Gamma} \boldsymbol{K}^{\prime}/2$, where $\boldsymbol{K }=\boldsymbol{ I }\otimes [1 1]$ (I is the identity matrix of $\mathit{m} \times \mathit{m}$, where $\mathit{m}$ is the number of individuals and $\otimes$ is the Kronecker product).

CCA-based GWAS

Canonical correlation analysis (CCA) uses linear combinations of variables derived from two sets of data objects to find the combination that is maximally correlated. In our study, the CCA-based GWAS used the same set of SNPs as used in the SNP-based GWAS but extended the association analysis of a single trait to multiple traits. The model can be written as:

$$\rho ({{\varvec{x}}}^{\boldsymbol{^{\prime}}},{{\varvec{y}}}^{\boldsymbol{^{\prime}}})=\frac{Cov({{\varvec{x}}}^{\boldsymbol{^{\prime}}},{{\varvec{y}}}^{\boldsymbol{^{\prime}}})}{\sqrt{Var\left({{\varvec{x}}}^{\boldsymbol{^{\prime}}}\right)}\sqrt{Var({{\varvec{y}}}^{\boldsymbol{^{\prime}}})}}$$

where $\rho ({{\varvec{x}}}^{\boldsymbol{^{\prime}}},{{\varvec{y}}}^{\boldsymbol{^{\prime}}})$ is the correlation of ${{\varvec{x}}}^{\boldsymbol{^{\prime}}}$ and ${{\varvec{y}}}^{\boldsymbol{^{\prime}}}$ projection vectors; ${\varvec{y}}$ is the vector of multiple egg-laying trait from the same laying stage (up-stage, sustained-stage, or all-stage); ${{\varvec{y}}}^{\boldsymbol{^{\prime}}}$ is the projection vector of ${\varvec{y}}$, denotes ${{\varvec{b}}}^{{\varvec{T}}}{\varvec{y}}$. ${\varvec{x}}$ is the vector of genotype encoded by 0, 1, or 2; ${{\varvec{x}}}^{\boldsymbol{^{\prime}}}$ is the projection vector of x, denotes ${{\varvec{a}}}^{{\varvec{T}}}{\varvec{x}}$; ${\varvec{a}}$ and ${\varvec{b}}$ maximize $\rho ({{\varvec{x}}}^{\boldsymbol{^{\prime}}},{{\varvec{y}}}^{\boldsymbol{^{\prime}}})$ to obtain the corresponding projection vector.

For each SNP, a P-value was estimated by calculating the correlation between genotype and multiple traits using the Chi-square test. The CCA-based GWAS also reports the canonical correlation coefficients between genotype and trait value, which can be used to measure the relative contribution of SNPs to specific traits. CCA is implemented using the ‘cancor' function in the R package ‘stats'.

Genome-wide prediction using haplotypes

Before estimating the effect size of haplotype alleles, we implemented a numerical dosage coding strategy. For each significant haplotype block identified by HGWAS, diploid individuals were coded by 0, 1, or 2, representing the number of copies of a specific haplotype allele present in the respective individual. The sum of codes across all haplotype alleles within a block equal two for each individual.

Two haplotype-based phenotype prediction methods are proposed. In the first method, the effect size of haplotype alleles was estimated as fixed effects. The model can be written as:

$${\varvec{y}}={\varvec{Q}}\boldsymbol{\alpha }+{\varvec{x}}{\varvec{\beta}}+{\varvec{e}}$$

where ${\varvec{y}}$ is the vector of egg-laying trait; ${\varvec{Q}}$ is the design matrix of covariates, including the birth batch; ${\varvec{x}}$ is the indicator of haplotypes dosage code; $\boldsymbol{\alpha }$ is the effect vector of covariates; ${\varvec{\beta}}$ is the vector of effect size of haplotype allele as fixed effect; and ${\varvec{e}}$ is the residual. For haplotype alleles showing statistical significance (P-value < 0.05), those with a positive effect size were classified as BHA, while those with a negative effect size were classified as UHA. After obtaining the effect sizes of haplotype alleles, the haplotype-based polygenic prediction score (HPPS) was calculated by summing the effects of multiple haplotype alleles from all significant blocks to estimate an individual's phenotypic predictions. We calculated three varieties: HPPS_all for all significant haplotype blocks, HPPS_bene for only blocks comprising BHA, and HPPS_unbe for only blocks comprising UHA. The HPPS calculations were performed using the lm function in R.

The second method is very similar to the first one, except that the effect of haplotype alleles was estimated as the random effects. The model can be written as:

$${\varvec{y}}={\varvec{Q}}\boldsymbol{\alpha }+{\varvec{Z}}{\varvec{\mu}}+{\varvec{e}}$$

where ${\varvec{Z}}$ is the indicator of haplotypes dosage code; ${\varvec{\mu}}$ is the vector of effect of haplotype allele as random effect, and ${\varvec{e}}$ is the residual. For random effects, we used random effect estimates for all haplotype alleles and called the prediction as Haplotype-based Best Linear Unbiased Prediction (HBLUP). The HBLUP calculations were performed using the R package ‘hglm'.

To evaluate the accuracy of genomic prediction, we implemented a constant validation set approach, wherein a fixed subset of individuals was consistently excluded from model training. Specifically, we randomly selected 100 individuals to constitute a fixed validation set. The model's accuracy was then evaluated exclusively using this constant validation set within the 10-fold cross-validation (CV) framework. The Pearson correlation coefficient was used to quantify the correlation between predicted and observed phenotypes. This process was repeated 50 times, and we reported the mean correlation coefficient as the measure of prediction accuracy.

Analysis on allele-stage interactions

To investigate the interaction between genotype and stages on egg-laying trait, we employed GLM with an interaction term. In this model, the up-stage and sustained-stage of egg production were treated as one fixed effect, while the genotypes were considered as another fixed effect. The model can be written as:

$${\varvec{y}}={\varvec{Q}}\boldsymbol{\alpha }+{\varvec{x}}{\varvec{\beta}}+{\varvec{\gamma}}\left({\varvec{Q}}\times {\varvec{x}}\right)+{\varvec{e}}$$

where ${\varvec{y}}$ is the vector of egg-laying trait; ${\varvec{Q}}$ is the vector of stage as factors (up-stage/sustained-stage); ${\varvec{x}}$ is the vector of genotype as factors (aa/Aa/AA); $\boldsymbol{\alpha }$ is the effect vector of stage; ${\varvec{\beta}}$ is the effect vector of each SNP; ${\varvec{\gamma}}$ is the interaction effect between stage and genotype; and ${\varvec{e}}$ is the residual. A post-hoc analysis was subsequently conducted to identify the specific level of interaction between stage and genotype, when ${\varvec{\gamma}}$ tests significant. The post-hoc test was implemented by the R package ‘emmeans'.

Proportion of variance explained by genetic loci

The proportion of variance explained (PVE) by SNP or haplotype was calculated following the method proposed by Gu et al. [27]. A random model was used to fit both significant loci and insignificant loci. The model can be written as:

$${\varvec{y}}={{\varvec{Z}}}_{1}{{\varvec{\mu}}}_{1}+{{\varvec{Z}}}_{2}{{\varvec{\mu}}}_{2}+{\varvec{e}}$$

where ${\varvec{y}}$ is the vector of egg-laying trait; ${{\varvec{Z}}}_{1}$ and ${{\varvec{Z}}}_{2}$ are design matrixes of genotype encoded by 0, 1, or 2; ${{\varvec{\mu}}}_{1}$ is the vector of the genetic effect for significant loci, and ${{\varvec{\mu}}}_{2}$ is the vector of the genetic effect for insignificant loci, with ${\text{Var}({\varvec{\mu}}}_{i})\sim \text{N}(0,\ {{\varvec{K}}}_{{\varvec{i}}}{\sigma }_{{{\varvec{\mu}}}_{{\varvec{i}}}}^{2})$, where ${{\varvec{K}}}_{{\varvec{i}}}$ denotes the GRM constructed from the significant loci (i = 1) and insignificant loci (i = 2), and ${\sigma }_{{{\varvec{\mu}}}_{{\varvec{i}}}}^{2}$ denotes the genetic variance corresponding to the random effect; and ${\varvec{e}}$ is the residual. The PVE of significant loci from SNP-based GWAS or haplotype-based GWAS can be expressed as $\frac{{\sigma }_{{{\varvec{\mu}}}_{1}}^{2}}{{\sigma }_{{{\varvec{\mu}}}_{1}}^{2}+ {\sigma }_{{{\varvec{\mu}}}_{2}}^{2}+ {\sigma }_{e}^{2}}$. The genome-wide PVE from SNP-based GWAS and haplotype-based GWAS can be expressed as $\frac{{\sigma }_{{{\varvec{\mu}}}_{1}}^{2}+ {\sigma }_{{{\varvec{\mu}}}_{2}}^{2}}{{\sigma }_{{{\varvec{\mu}}}_{1}}^{2}+ {\sigma }_{{{\varvec{\mu}}}_{2}}^{2}+ {\sigma }_{e}^{2}}$. The variance components were estimated by the EMREML approach using “Single trait model” from HIBLUP (V1.4) [28].

Enriched score for beneficial and unbeneficial haplotype allele in Chinese chicken populations

For significant haplotype blocks identified by HGWAS across different traits, we traced the origin and examined the frequency of both BHA and UHA in 39 local chicken populations in China. Due to discrepancies between the two SNP sets, we filled missing loci in the 39 local chicken populations with the allele from the reference genome to make the haplotype alleles directly comparable. We defined an enriched score to measure the extent of enrichment of BHA and UHA in local chicken populations as:

$${E}_{k}= \sum_{i= 1}^{n}{f}_{nk}$$

where ${E}_{k}$ denotes the overall enriched score of a trait in the $\mathit k$^th population; $n$ denotes the number of BHA and UHA for this trait; and ${f}_{nk}$ denotes the frequency of the $n$^th haplotype allele in the $\mathit k$^th population.

Historical effective population size estimation

Historical effective population size (Ne) was estimated from genome-wide SNPs by GONe software V1.0 using default parameters [29]. We ran GONe assuming a recombination rate of 300 kb per centimorgan and a generation time of one year using all 888 samples. As recommend, the GONe estimation is most reliable from recent 100–200 generations.

Calculation of iHS, Tajima's D and π values

The integrated haplotype score (iHS) is a haplotype-based method for detecting positive selection, specifically focusing on strong hard sweeps. The iHS was calculated for the focal SNP by considering the extended haplotypes surrounding it, using the ihsbin program in hapbin software V1.3.0 [30] with the parameters: --minmaf 0.01 and --cutoff 0.01. The iHS values were Z-score standardized and then converted into P-values using the standard normal distribution. We applied Bonferroni correction at the genome-wide level to adjust these P-values.

We divided the genome into 100 K non-overlapped windows. The Tajima's D and π (nucleotide diversity) values were calculated for each window using VCFtools v0.1.17 [31] with the parameter -TajimaD 100000 and -window-pi 100000, respectively.

SDS for recent polygenic selection

The singleton density score (SDS) is a statistical method for detecting recent selection based on the density of singletons. Under the assumption that haplotypes carrying the selected allele tend to carry fewer singleton mutations, this method models the changes in genealogical tip-branch lengths, calculated from distance between the nearest singletons between the derived and ancestral alleles.

We calculated SDS using the script provided by Field et al. [32]. The recent Ne was set to 20,436 according to the GONe estimation. We estimated gamma-shapes for each derived allele frequency (DAF) bin, with a bin width of 0.005 and DAF ranging from 0.05 to 0.95. As no significant difference was found between ancestral alleles (AAs) and derived alleles (DAs) in SDS calculation, we compared reference alleles with alternative alleles instead. Thus, SNPs with SDS > 0 indicate that the alternative allele was positively selected, and vice versa [33]. The raw SDS scores (rSDS) were converted to P-values using the Z-score approach, similar to the method applied for iHS. We identified potential selection regions based on high Z-scores of mean SDS values (mSDS) in non-overlapping genomic window of 50 kb.

Associating tSDS with GWAS summary statistics

Trait-SDS (tSDS) were generated by polarizing the SDS scores such that a positive tSDS indicates an increased frequency of the allele favoring the trait. All SNPs with assigned tSDS were ranked by their GWAS −log₁₀(P) in ascending order and grouped into consecutive bins of 1,000 SNPs each. For each of the 39 egg-laying traits, we calculated Spearman correlation coefficients between the rank of the bins and the average tSDS within each bin. A positive correlation suggests that the polygenic trait was favored by selection, while a negative correlation indicates not favored by selection.

Multi-tissue transcriptome profiling

A total of 78 RNA sequencing samples (14 from the hypothalamus and 16 from pituitary, ovary, liver, and abdominal fat tissue) from 43-week-old high-yielding (n = 8) and low-yielding (n = 8) chickens were used retrieved from Wang et al. [7]. The multi-tissue transcriptome profiling using HISAT2 [34] and StringTie [35] are the same as in our previous work [7].

Gene annotation, functional analysis and prioritization

To identify candidate genes that may affect chicken egg production, genes overlapping with GWAS significant loci were extracted according to the Ensembl genome annotation. Chicken QTL information from Chicken QTLdb (https://www.animalgenome.org/cgi-bin/QTLdb/GG/index) was used to match the candidate genes functionally related to egg-laying traits. GO enrichment analysis was performed using the Metascape web service (https://metascape.org) [36]. Due to data availability constraints, the enrichment analysis was based on chicken gene orthologs in mouse species.

Gene prioritization of candidate genes based on functional similarity was conducted using the ToppGene web service (https://toppgene.cchmc.org/) [37]. A total of 291 genes associated with human reproduction from the GWAS catalog (https://www.ebi.ac.uk/gwas/) [38] were used as the training set, and genes retrieved from SNP-based GWAS and haplotype-based GWAS were used as candidates to prioritize.

To reveal the biological context of the identified loci, a comprehensive biological network was constructed for CNNM2 and the top 20 candidate genes using the web version of the GeneMANIA (https://genemania.org/) [39]. GeneMANIA integrates data on protein-genetic interactions, pathways, co-expression, co-localization, and protein structural domain similarity to construct biological networks and identify related genes in common pathways. The resulting networks were visualized using Cytoscape (V3.8.2) [40].

Results

Generating derived egg-laying traits

We considered the egg-laying behavior of chickens as a continuous and dynamic process from the start of egg-laying to the end of production. Multiple models were employed to fit the mean of the egg production rate, including the Wood model (BIC = −114.313), the compartmental model (BIC = −146.244), and the Yang-Ning model (BIC = −160.423) (Fig. 1a). The Yang-Ning model provides the best fit for the data, with its four parameters of specific biological implications: the maximum potential of egg production rate (a = 0.837), the rate of decline in egg production (b = 0.015), the rate of increase in egg production (c = 1.182), and the week of age at first egg (d = 2.621), altogether providing a plausible characterization of the biological features of the egg production.

According to the fitted curve of egg production rate, the stationary point of the mean curve in the chicken population was found to be at 26 weeks of age (${f}{\prime}\left(26.3\right)=0$). We, therefore, divided the egg-laying process into three stages: the up-stage (21–26 weeks), the sustained-stage (27–43 weeks), and the all-stage (21–43 weeks). Classifying these stages helps calibrate our understanding of the dynamics and underlying mechanisms of egg production throughout the laying cycle. Subsequent data analyses were conducted separately for the up-, sustained-, and all-stages. For the imputed egg production data, 13 derived traits related to production (5 traits), clutch (3 traits), and interval (5 traits) were constructed focusing on egg production efficiency and stability from the three egg-laying stages, totaling 39 traits (Fig. 1b, c, Supplementary Fig. S1, and Supplementary Table S1). Although some traits exhibited moderate to high phenotypic correlations (Fig. 1d and Supplementary Fig. S1), this does not necessarily indicate strong genetic correlation. Moreover, each trait has distinct biological significance. Therefore, all derived traits at each stage were retained for subsequent analyses.

Genetic mapping for egg production efficiency

It is widely acknowledged the polygenic nature of complex traits. Across the three egg-laying stages, GWAS for most traits revealed no genome-wide significant signals (Supplementary Fig. S2). However, for the up-ECI (effective clutch intensity) trait, eight significant loci were identified on GGA1, GGA5, and GGA18 (Fig. 2a, Table 1). In the region spanning from 126.4 Mb to 126.5 Mb on GGA1, three SNPs, rs13935264 (P = 1.075e-07), rs13935293 (P = 5.841e-09) and rs739946136 (P = 4.321e-08), were annotated to the gene SHROOM2. In the region from 46.1 Mb to 46.4 Mb on GGA5, rs317474373 (P = 6.657e-10) and rs317620935 (P = 1.524e-08) were annotated to the gene SYNE3. The genomic region comprising the significant loci on GGA5 exhibited strong linkage disequilibrium (LD), suggesting high reliability of the associated signals (Fig. 2b). As an example, the three genotypes of rs316710646 showed ordinal changes in up-ECI trait, i.e., ${GG}_{up-ECI}> {GA}_{up-ECI}> {AA}_{up-ECI}$, suggesting the G allele to be beneficial for the trait (Fig. 2c). In addition, a significant SNP rs316748724 (P = 2.460e-08), located around 4.3 Mb on GGA18, was annotated to the gene CYGB. Some significant signals were detected for the sustained-TILI (total inter-laying interval) trait (Supplementary Fig. S3, Table 1). Three significant signals were located between 30.5 Mb and 30.7 Mb on GGA7, with the most significant SNP, GGA7_30617934 (P = 3.729e-09), annotated to the gene R3HDM1. Another significant SNP on GGA7, rs794029118 (P = 2.841e-08), was annotated to the gene ZRANB3. Moreover, rs1058884213 (P = 1.066e-08), around 38.2 Mb on GGA5, was annotated to the gene LTBP2.

Table 1 Summary statistics for significant SNPs in SNP-based GWAS

Full size table

Haplotype analysis may provide more power to detect associations as haplotypes can capture the combined effects of multiple linked loci [41]. Unlike conventionally additive SNP encoding, haplotypes are typically encoded as categorical variables. Genome-wide haplotype GWAS (HGWAS) was performed using a non-overlapping window with five-SNPs, for all 7,065,827 SNPs after phasing and pruning. HGWAS detected many association signals that had not shown significance in SNP-based GWAS. For the up-ECI trait, significant haplotype signals were predominantly occurred on GGA1, GGA4, GGA5, and GGA14 (Fig. 2d, Supplementary Table S2). New significant loci were found for all-TILI trait on GGA18 (Supplementary Fig. S4, Supplementary Table S3). HGWAS also revealed significant loci for all-WEV (weekly egg production variance) trait and up-MILI (maximum inter-laying interval) trait on GGA4, which were undetected in SNP-based GWAS (Supplementary Fig. S4, Supplementary Table S4 and S5).

As HGWAS only reports significant haplotype blocks, we employed separate general linear models (GLM) to estimate the effect size of each haplotype allele within these blocks, utilizing haplotype allele dosage coding. Beneficial haplotype alleles (BHA) and unbeneficial haplotype alleles (UHA) were classified based on their positive or negative effects on desirable egg-laying traits. For instance, for the up-ECI trait, 262 significant blocks were identified genome-wide, comprising 140 blocks containing BHA, 86 blocks containing UHA, and 10 blocks containing both. Lists of BHA and UHA for all traits with significant HGWAS signals are provided in Supplementary Table S6.

Both SNP-based GWAS and HGWAS test each locus independently. For multiple loci associated with the same trait, they can be incorporated in one model simultaneously to estimate the phenotypic variation jointly explained by them. For significant loci identified by SNP-based GWAS, a genomic relationship matrix (GRM) was constructed for each trait using SNPs from significant loci, and the corresponding proportion of variance explained (PVE) was computed by random effect. Similarly, haplotype alleles from the significant haplotype blocks identified by HGWAS were used to construct haplotype-based GRM and the corresponding PVE was calculated. For the up-ECI trait, the PVE from HGWAS was 38.7%, three-fold improvement over the 11.2% of PVE from SNP-based GWAS. For all traits with significant GWAS signals, lists of PVE estimates from significant loci identified by SNP-based and haplotype-based GWAS are provided in Table 2.

Table 2 Significant loci PVE and genome-wide PVE in GWAS

Full size table

Polygenic architecture for egg-laying traits

A large number of genomic HGWAS signals indicated the polygenic nature of the egg-laying traits. Despite the significant signals reported, many loci with small effects remained unidentified under the GWAS cut-off. To account for the contributions of these loci, model with two random effects from both significant and insignificant loci was constructed. The model for haplotypes explained 42% of the phenotypic variance for the up-ECI trait, while the model for SNPs accounted for only 25.5%. For all traits with significant GWAS signals, lists of genome-wide PVE by SNPs and haplotypes are provided in Table 2. For polygenic traits, loci with small effects are distributed across chromosomes. Consequently, the PVE by each chromosome would correlate positively with its length. For the up-ECI trait, a strong positive correlation (r = 0.602) was indeed observed based on haplotypes, although some intermediate and small chromosomes exhibited disproportionately large PVEs relative to their length (Fig. 3a). We also examined the number of BHA and UHA within each significant haplotype block associated with egg-laying traits. It was found that significant haplotype blocks contained an average of 1.38 BHA and 2.99 UHA, highlighting the multi-allelic architecture of the egg-laying traits (Fig. 3b).

The polygenic architecture of the trait enables phenotype prediction using whole-genome data. For each HGWAS significant block, the effect size of each haplotype allele was firstly estimated as fixed effect using GLM. On this basis, the effect size for all significant blocks of diploid individuals were summed up and termed the haplotype-based polygenic prediction score (HPPS). Likewise, in a separate model, the haplotype allele effect was estimated as random effect, referred to as haplotype-based best linear unbiased prediction (HBLUP). The genomic best linear unbiased prediction (GBLUP) method, widely used in animal breeding, was also employed. The GBLUP approach, which does not require estimation of individual marker effects, was implemented using GRMs constructed from both genome-wide SNPs or haplotypes (Fig. 3c). The predictive performance was assessed by correlating the predicts with the original phenotype value. To reduce the risk of overfitting, evaluations were conducted against the constant validation set. Taking up-ECI trait as an example, based on 50 iterations of tenfold cross-validation, it was found that the traditional GBLUP method (R = 0.246) and GBLUP on the haplotype GRM (R = 0.253) did not perform very well. Notably, the HBLUP (R = 0.295) was slightly more accurate than HPPS_all (R = 0.289) which uses all haplotype alleles from the significant blocks. The HPPS using only BHA (R = 0.267) or UHA (R = 0.249) showed moderate performance. The amalgamation of BHA and UHA proved superior to segregating them, implying that including unbeneficial alleles aids in enhancing phenotype prediction.

Tracking trait-associated haplotype alleles in chicken populations

As shown in Fig. 4a and Supplementary Fig. S5, numerous haplotype alleles were effective on up-ECI trait across the genome. By comparing sequences between haplotype alleles within a haplotype block, we found mutations on some haplotype alleles significantly alter the degree of associations with trait. For example, in haplotype block No.173 (genomic position GGA5: 46,764,467–46,765,093 bp) that showed significant association with the up-ECI trait, a G > A mutation changed the haplotype allele “AGGGG” to “AGGGA”. Despite the frequency of the derived haplotype allele “AGGGA” was low (f = 0.001), the effect size increased from 0.337 (“AGGGG”) to 5.788 (“AGGGA”) (Fig. 4b), suggesting that this BHA may appear recently. In contrast, a BHA “GACAC” at low frequency (f = 0.037) but a high effect size of 2.0 within haplotype block No. 247 (genomic location GGA18: 4,206,126–4,206,301 bp), corresponding to gene MGAT5B, had limited identity with the dominant haplotype allele “GGAGT” (f = 0.856) with an effect size of only 0.243 (Supplementary Fig. S6). Surprisingly, the BHA “GACAC” had the highest frequency (45%) in the Red Jungle Fowl (RJF) and showed a trend of decreasing frequency from the South to the North in domestic chickens (Supplementary Fig. S7). A functional survey revealed that MGAT5B promotes egg production by up-regulating the expression of egg-laying related genes AKR1D1 and EDA2R [42]. However, MGAT5B has also been implicated in neurological disorders, where its deficiency led to decreased astrocyte activation and enhanced oligodendrocyte maturation [43]. We suspected that MGAT5B has been subject to negative selection in local chicken populations, resulting in a trade-off that sacrifices some egg yield to potentially reduce the incidence of neurological disorders.

We further examined the frequencies of BHA and UHA identified in all egg-laying traits of the Gushi chicken among 39 Chinese local chicken populations (Supplementary Table S7). The Ningdu Yellow chicken (NDY) was found to have the highest sharing rate of BHA (${E}_{\text{NDY}}^{\text{BHA}}$= 16.015) for the up-ECI trait and the highest sharing rate of UHA (${E}_{\text{NDY}}^{\text{UHA}}$= 12.925) for the up-MILI trait (Fig. 4c and d). This suggests substantial genetic exchange between the NDY and the Gushi chicken, which is not surprising given that two breeds are distributed in close geographic proximity. Interestingly, the Beijing-You chicken (BJY) in northern China exhibited high degree of shared haplotype alleles (${E}_{\text{BJY}}^{\text{BHA}}$= 1.947) associated with increased phenotypic variance in the all-WEV HGWAS. In contrast, the two southern chicken populations (Huaibei Partridge chicken (HBP) and Dulong chicken (DL)) showed a high degree of shared haplotype alleles (${E}_{\text{HBP}}^{\text{UHA}}$= 1.513, ${E}_{\text{DL}}^{\text{UHA}}$= 1.438) that decrease phenotypic variance (Supplementary Fig. S8). The lower egg production variance of southern Chinese chickens may be attributed to the uniform sunshine and temperature in southern China, which creates suitable conditions for year-round egg production. Conversely, northern Chinese chickens exhibited relatively high egg production variance, potentially reflecting adaptation to greater variability of the environment in the North.

Polygenic selection on egg production efficiency

Domesticated chickens are subjected to both natural and artificial selection. Given the high polygenicity of egg-laying trait, the response to selection would differ from that of traits influenced by a few high-impact loci. As expected, only few significant selective signals on GGA6 were detected by iHS approach (Supplementary Fig. S9). The haplotype-based H12 statistic yielded similar results and showed no signal of purifying selection (Supplementary Fig. S10). However, Tajima's D scores ranged between 2 and 4 across most genomic regions, indicating a significant deviation from the neutral expectations (${P}_{t.test(\text{Tajima}^{\prime}\text{s D}=0)} < 0.01$) (Supplementary Fig. S11).

To better quantify selection for complex traits, the singleton density score (SDS) was estimated for all SNPs in individual genomic windows of 50 kb. The raw SDS (rSDS) results showed that the Gushi chicken was subjected to polygenic selection over the past ~100 years (Fig. 5a). In the high mSDS region on GGA2, we observed relatively low π and high iHS values. As iHS detects selection signals that occurred approximately 1,000 years ago while SDS captures recent selection signals within the last 100 to 200 generations, the pattern suggests that this region has been under selection over last 100 years. Conversely, in the high mSDS region on GGA27, both iHS and π values were low, indicating that the selection on this region took place only recently (Supplementary Fig. S12).

We polarized the sign of the rSDS according to the GWAS result for each trait, so a positive trait-SDS (tSDS) indicates an increase in the frequency of allele favoring the trait. For polygenic traits, SNPs that do not reach genome-wide significance in GWAS also contribute to phenotypic variations. Therefore, in the tSDS analysis, we considered all SNPs rather than only significant SNPs in GWAS. For each trait, the Spearman correlation between −log₁₀(P) from SNP-based GWAS and tSDS values was computed (Fig. 5b, Supplementary Table S8). Theoretically, if a trait is favored by polygenic selection, a positive coefficient is expected; otherwise, the correlation would be negative. We found that the Gushi chicken was subjected to different polygenic selections at different stages: EV trait experienced positive selection during the up-stage (Fig. 5c), while it was selected against during the sustained- and all-stages (Fig. 5b). We hypothesized that during the up-stage, artificial selection favors higher egg production variance to facilitate accelerated productivity growth, whereas in the sustained-stage, lower egg production variance is favored to sustain prolonged egg production at a satisfactory level. Moreover, the frequent alternations between clutches and intervals may interrupt egg-laying continuity, leading to the observation that the CPN (clutch period number) trait was selected against, whereas the ECI trait was favored by selection during the sustained-stage (Fig. 5b). These pieces of evidence collectively indicate refined polygenic selection for enhanced egg production efficiency.

Using a similar approach to correlate HGWAS summary statistics with SDS, we found that SNPs corresponding to BHA had a positive mSDS, whereas SNPs corresponding to UHA had a negative mSDS. We found that polygenic selection was even more pronounced at the haplotype level: the average tSDS for the up-ECI HGWAS significant block was 0.160, much higher than that for SNPs (0.026). This suggests that the polygenic selection on egg production efficiency is better captured using haplotypes.

Dual role of CNNM2 on egg production variance to optimize egg production efficiency

Since loci that promote multiple egg-laying traits related to egg production efficiency were selectively favored, this suggests that certain genes or loci may have pleiotropic effects on egg production efficiency. Canonical correlation analysis (CCA)-based GWAS was employed to test the correlation between each SNP and multiple traits simultaneously in the up-stage and sustained-stage. Upon a 5% false discovery rate (FDR) correction, 22 significant SNPs were identified during the up-stage, of which 7 were located between 23.2 Mb and 25.0 Mb on GGA6 (Fig. 6a, Supplementary Table S9). A total of 166 significant SNPs were detected during the sustained-stage, among them 133 were located between 23.3 Mb and 28.9 Mb on GGA6 (Fig. 6b, Supplementary Table S10). The significant signals on GGA6 in both the up-stage and sustained-stage were mapped to the same gene CNNM2, which spans approximately 100 kb and harbors three associated SNPs in its intron 1 during the up-stage and two associated SNPs in its introns 2 and 4 during the sustained-stage. Although these five significant SNPs did not physically overlap (being about 20 kb apart), they were in strong LD (Fig. 6c).

For each of the five SNPs, the canonical coefficients between the SNP genotypes and each trait were calculated to reflect the relative contribution of the SNP effects on the individual trait (Fig. 6d). The three up-stage associated SNPs in its intron 1 were positively correlated with production variance (EV, WEV, BWEV) during the up-stage, while EV and WEV show negative correlation with the same genotypes during the sustained-stage. As an example, the CC genotype of rs734813104 increases the EV during the up-stage, but decreases it during the sustained-stage, indicating dual role of this genotype during the two stages. On the other hand, the two sustained-stage-associated SNPs in introns 2 and 4 are negatively correlated with production variance, contributing to stability during the sustained-stage (Fig. 6d). Considering the strength and direction of effects of the three up-stage associated SNPs, the five SNPs together constitute a dual role of the CNNM2 gene on egg production variance during the two stages, which aligns with patten of polygenic selection observed.

To further elucidate the stage-dependent genetic mechanisms underlying egg production, we employed an alternative approach by constructing linear models to pinpoint specific interacting genotype with laying stage on EV trait. The three up-stage associated SNPs identified through CCA-based GWAS showed significant genotype-by-stage interactions (P = 0.008, 0.008, 0.005). Post-hoc analysis revealed that during the up-stage, the CC genotype of rs734813104 was significantly associated with increased EV (effect size = 0.94, P = 0.01), while the TT genotype was significantly associated with decreased EV (effect size = −1.59, P = 0.03), supporting the results from the canonical coefficients.

Altogether, CNNM2 exhibits a dual role at both the SNP and gene levels on egg production variance during the up-stage and sustained-stage of the egg-laying cycle to optimize egg production efficiency.

CNNM2 is functionally important in egg-laying

In this study, a large number of candidate genes were identified through different mapping approaches. Haplotype-based GWAS identified the highest number of candidate genes (542), followed by CCA-based GWAS (65), and SNP-based GWAS identified the fewest (9). Two candidate genes, LTBP2 and SYNE3, were found in both haplotype-based and SNP-based GWAS, and three candidate genes, EYA1, BTRC, and PNPLA7, were found in both haplotype-based and CCA-based GWAS (Fig. 7a). The candidate genes reported in this study partially overlapped with the reproduction-related genes in the chicken QTLdb (Fig. 7a). Moreover, the genes identified through SNP-based and haplotype-based GWAS showed substantial overlap with those from our previous study, despite the derived egg-laying traits not being exactly the same (Supplementary Fig. S13). Notably, when applying the same significance threshold as in our earlier work, we successfully replicated significant associations with genes such as TFPI2, CAMK2D, OSTN and APOA4 (Supplementary Table S11). GO enrichment analysis of all candidate genes using Metascape (Fig. 7c and d) revealed that the enriched terms including heart development, vascular development, neuromodulation, embryonic development, and cell adhesion. Notably, these functional categories aligned with the known roles of the HPO axis [44].

The candidate genes were further prioritized to determine their biological relevance, in which known genes associated with human reproduction traits in the GWAS catalog were used as the reference set. A total of 419 candidate genes were successfully prioritized. The top 20 prioritized genes (Supplementary Table S12), along with CNNM2, were then selected to construct an integrative biology network (Fig. 7b). Notably, CNNM2 occupies a central position in this network by interacting with other genes through three hub genes: SLIT3, LAMA2, and SMOC2. We checked the gene expression of CNNM2 across multiple tissues. It is believed that mutations located within introns can affect splicing and gene expression. Since the Illumina sequencing platform we used is not well-suited for accurately detecting alternative splicing, we tried to investigate the correlation between the genotypes of five associated intron SNPs, as well as SNPs in high LD with them, and the expression of CNNM2. Unfortunately, we were unable to find any significant correlation. We did find that CNNM2 was highly expressed in the ovary (Fig. 7e). By comparing the expression of CNNM2 between high-yield and low-yield chickens, we observed significant lower expression of CNNM2 in hypothalamus of high-yield chickens. This finding confirms that CNNM2 plays a role in egg-laying in chickens (Fig. 7e).

Discussion

GWAS has become a standard approach for genetic mapping. In this study, many candidate genes were identified through GWAS on up-ECI and sustained-TILI traits. The annotated genes function directly or indirectly to the egg production. Among them, SHROOM2 is highly expressed in the endothelium of the developing vasculature, playing a crucial role in the initial formation and subsequent remodeling of vascular network [45, 46]. SYNE3 on the other hand, is essential for perinuclear cytoskeletal organization and the attachment of centrosome to nuclear envelope [47]. It forms spermiogenesis-specific LINC complexes with Sun1η [48], suggesting its involvement in the process of sperm development. CYGB, a cytoglobin, may be involved in collagen synthesis or in the function of O²-consuming enzymes, facilitating O² diffusion to respiratory chain [49, 50]. R3HDM1 affected the formation of neural networks in brain, modulating the growth and branching of dendrites in mouse normal neurons [51]. Additionally, R3HDM1 has been identified as one of the up-regulated genes in the pregnant endometrium, suggesting its involvement in reproductive regulation [52]. Some associated genes seem to have no direct relation to egg-laying, this may be due to the insufficiency of our current understanding of reproductive processes. For example, ZRANB3 is typically involved in maintaining genomic stability during DNA replication [53]. LTBP2 plays a positive role in lung elastinogenesis [54]. In summary, these genes correspond to various functions, suggesting complex molecular mechanisms underlying egg production.

While GWAS has been highly successful, SNP-based GWAS has known limitations in fully uncovering the genetic basis of complex traits [55]. Haplotype-based GWAS aggregates the effects of multiple linked loci, potentially combining additive, dominant, and short-range epistatic effects together. Studies have shown that haplotype-based GWAS is superior to SNP-based GWAS in terms of statistical power, allelic effect estimation, and avoiding false-positives [56,57,58]. In our study, haplotype-based GWAS detected more association signals that were undetected in SNP-based GWAS for egg-laying traits. In addition, for both aspects of total number of loci influencing the trait and the distribution of their effect sizes, the haplotype-based analysis well demonstrated the polygenicity of egg-laying traits.

Although selection signals are often considered to be significant changes in allele frequencies at high-impact loci, for polygenic traits most alleles are of small effects, phenotypic improvement can be achieved by the accumulation of the polygenic effects [59]. Without the need for fixation or dramatic changes in allele frequencies, populations can also shift allele frequencies at multiple loci to achieve optimal fitness [60, 61]. From this viewpoint, polygenic traits can respond rapidly to selection based on existing standing variations [62, 63]. The chickens were subjected to polygenetic selection of increasing egg production variance during the up-stage, which covers the period from the start of egg-laying to the peak production. Larger variance implies greater potential and faster increase in productivity, that is, individuals with large egg production variance are favored in the early stages of egg-laying, possibly due to selection for achieving rapid productive output. Production stability is more important during the sustained-stage, so egg production variance is selected against during this stage. Stable long-term egg production aligns with the breeding goal of modern chickens [64], so individuals with smaller egg production variance are favored to maintain a longer egg-laying cycle and higher egg production. The five associated SNPs in the CNNM2 gene exhibited consistent characteristics. Three up-stage associated SNPs in CNNM2 promote production variance, facilitating more rapid gains in production capacity from the start of egg-laying in chicken population. Conversely, two sustained-stage associated SNPs in CNNM2 decrease variance during this period, resulting in a smooth decline in the laying curve and maintaining higher stability and overall efficiency in egg production.

From a gene functional perspective, CNNM2 encodes a protein that plays an important role in magnesium homeostasis. It is widely recognized that magnesium plays a crucial role in neural conduction and neuronal signaling. Mutations in CNNM2 are associated with hypomagnesemia, seizures, and impaired intellectual development [65]. In addition to genetic mechanisms, we explored the relationship between gene expression and egg-laying performance. As the expression of CNNM2 in hypothalamus is significantly lower in high-yield chickens, we proposed two hypotheses: (1) decreased expression of CNNM2 may reduce sensitivity to external stimuli by impairing cognitive function [66,67,68,69,70,71], or (2) reduced CNNM2 expression may redirect energy allocation from growth to reproduction, thereby enhancing egg production [72, 73].

We identified several genes that preferentially interact with CNNM2. Among them, LEPR encodes the receptor of leptin. While leptin is primarily known as an appetite regulator, it contributes to the initiation and maintenance of reproductive activity by promoting gonadotropin secretion through the hypothalamus [74,75,76,77]. It has been reported that leptin inhibits follicular development by directly antagonizing ovarian estradiol and progesterone secretion stimulated by follicle-stimulating hormone (FSH) or insulin-like growth factor I (IGF-I) [78,79,80]. ITGB1 plays a critical role in the migration of primordial germ cells to embryonic gonads in mice [81]. In spermatogenesis, both ITGA6 and ITGB1 serve as surface markers for mouse lymphocytes and spermatogonia [82]. SLIT3 may play an inhibitory role in the proliferation, differentiation, and follicular selection of granulosa cells in the prehierarchical follicles in the ovary [83]. LAMA2 exists in the neurons of the brain, regulates synaptic function and plasticity in the central nervous system [84]. SMOC2 encodes a secreted protein that exhibits a broad tissue distribution in embryonic and adult mice [85]. It has been reported that SMOC2 participates in the regulation of cell proliferation and angiogenesis [86]. Considering the findings of this study, we not only confirmed the polygenicity of egg-laying traits but also identified several significant GWAS signals associated with egg production efficiency and stability. We found that the combination of the two contribute a substantial proportion to the phenotypic variation. Our results agreed with the refined omnigenic model of egg production, which emphasizes the importance of core genes with large effects as well as the cumulative small effects across peripheral genes [87, 88].

Conclusions

This study makes three key contributions to advancing our understanding the genetic mechanisms underlying egg production. Firstly, we demonstrated the advantages of haplotypes in genetic mapping, elucidating polygenic architecture, tracing ancestral origins, detecting polygenic selection and predicting phenotypes. Secondly, by utilizing traits generated to characterize egg production efficiency and stability, we identify key genes that affect these traits. Thirdly, by investigating polygenic selection signals of complex traits and applying multiple association analyses, we uncover the genetic basis of egg production efficiency. Our results provide valuable insights into enhancing egg production in chickens and position chickens as a model for studying the genetics of reproductive efficiency in other species.

Data Availability

The resequenced raw data from 888 Gushi hens were deposited in the Genome Sequence Archive (GSA) with accession number PRJCA021392. The RNA-seq raw data of 5 tissue types were deposited in NCBI Sequence Read Archive with accession number PRJNA893445 and PRJNA953784. All other data supporting the findings of this study are available within the article and its Supplementary Information files. Source data are provided with this paper.

Abbreviations

AILI:: Average inter-laying interval
BHA:: Beneficial haplotype alleles
BJY:: Beijing-You chicken
BLUP:: Best linear unbiased prediction
BWEV:: Bi-weekly egg production variance
BWMLR:: Bi-weekly maximum laying rate
CCA:: Canonical correlation analysis
CPN:: Clutch period number
CV:: Cross-validation
DAF:: Derived allele frequency
DL:: Dulong chicken
ECI:: Effective clutch intensity
EV:: Egg production variance
FDR:: False discovery rate
GLM:: General linear models
GRM:: Genomic relationship matrix
GWAS:: Genome-wide association study
HBP:: Huaibei Partridge chicken
HGWAS:: Genome-wide haplotype GWAS
HPO:: Hypothalamic-pituitary-ovary
HPPS:: Haplotype-based polygenic prediction score
iHS:: Integrated haplotype score
II:: Interclutch interval
LIT:: Laying interval time
MILI:: Maximum inter-laying interval
NDY:: Ningdu Yellow chicken
PCs:: Principal components
PVE:: Proportion of variance explained
QTL:: Quantitative trait loci
RJF:: Red Jungle Fowl
SDS:: Singleton density score
TCS:: Total clutch size
TILI:: Total inter-laying interval
UHA:: Unbeneficial haplotype alleles
WEV:: Weekly egg production variance
WMLR:: Weekly maximum laying rate

References

1.van der Klein SA, Zuidhof MJ, Bédécarrats GY. Diurnal and seasonal dynamics affecting egg production in meat chickens.(2020)A review of mechanisms associated with reproductive dysregulation.Anim Reprod Sci.: 106257.
10.1016/j.anireprosci.2019.106257
Article|CAS|PubMed|Google Scholar
2.Fu M, Wu Y, Shen J, Pan A, Zhang H, Sun J, et al. Genome-wide association study of egg production traits in shuanglian chickens using whole genome sequencing. Genes. 2023;14.(2023)org/10.3390/genes14122129.: 2129.
10.3390/genes14122129
Article|CAS|PubMed|Google Scholar
3.Chen A, Zhao X, Wen J, Zhao X, Wang G, Zhang X, et al. Genetic parameter estimation and molecular foundation of chicken egg-laying trait. Poult Sci. 2024;103.(2024)103627. https://doi. org/10. 1016/j.psj.: 103627.
10.1016/j.psj.2024.103627
Article|CAS|PubMed|Google Scholar
4.Wolc A, Jankowski T, Arango J, Settar P, Fulton J, O’Sullivan N, et al. Investigating the genetic determination of clutch traits in laying hens. Poult Sci. 2019;98.(2019)org/10.3382/ps/pey354.: 39.
10.3382/ps/pey354
Article|CAS|PubMed|Google Scholar
5.Wang J, Liu Z, Cao D, Liu J, Li F, Han H, et al. Elucidation of the genetic determination of clutch traits in Chinese local chickens of the Laiwu Black breed. BMC Genomics. 2023;24.(2023)org/10.1186/s12864-023-09798-0.: 686.
10.1186/s12864-023-09798-0
Article|CAS|PubMed|Google Scholar
6.Du Y, Liu L, He Y, Dou T, Jia J, Ge C. Endocrine and genetic factors affecting egg laying performance in chickens.(2020)A review.Br Poult Sci.: 538.
10.1080/00071668.2020.1758299
Article|CAS|PubMed|Google Scholar
7.Wang D, Tan L, Zhi Y, Bu L, Wang Y, Wang Z, et al. Genome-wide variation study and inter-tissue communication analysis unveil regulatory mechanisms of egg-laying performance in chickens. Nat Commun. 2024;15.(2024)org/10.1038/s41467-024-50809-9.: 7069.
10.1038/s41467-024-50809-9
Article|CAS|PubMed|Google Scholar
8.Narinc D, Uckardes F, Aslan E. Egg production curve analyses in poultry science. World Poultry Sci J. 2014;70.(2014)org/10.1017/S0043933914000877.: 817.
10.1017/S0043933914000877
Article|CAS|PubMed|Google Scholar
9.Liu Z, Yang N, Yan Y, Li G, Liu A, Wu G, et al. Genome-wide association analysis of egg production performance in chickens across the whole laying period. BMC Genet. 2019;20.(2019)org/10.1186/s12863-019-0771-7.: 67.
10.1186/s12863-019-0771-7
Article|CAS|PubMed|Google Scholar
10.Sella G, Barton NH. Thinking about the evolution of complex traits in the era of genome-wide association studies. Annu Rev Genomics Hum Genet. 2019;20.(2019)org/10.1146/annurev-genom-083115-022316.: 461.
10.1146/annurev-genom-083115-022316
Article|CAS|PubMed|Google Scholar
11.Sun C, Qu L, Yi G, Yuan J, Duan Z, Shen M, et al. Genome-wide association study revealed a promising region and candidate genes for eggshell quality in an F2 resource population. BMC Genomics. 2015;16.(2015)org/10.1186/s12864-015-1795-7.: 565.
10.1186/s12864-015-1795-7
Article|CAS|PubMed|Google Scholar
12.Chen S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta. 2023;2.(2023)1002/imt2.107.
Article|CAS|PubMed|Google Scholar
13.Bu L, Wang Q, Gu W, Yang R, Zhu D, Song Z, et al. Improving read alignment through the generation of alternative reference via iterative strategy. Sci Rep. 2020;10.(2020)org/10.1038/s41598-020-74526-7.: 18712.
10.1038/s41598-020-74526-7
Article|CAS|PubMed|Google Scholar
14.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10.(2021)org/10.1093/gigascience/giab008.
Article|CAS|PubMed|Google Scholar
15.Yang R, Guo X, Zhu D, Tan C, Bian C, Ren J, et al. Accelerated deciphering of the genetic architecture of agricultural economic traits in pigs using a low-coverage whole-genome sequencing strategy. Gigascience. 2021;10.(2021)org/10.1093/gigascience/giab048.
Article|CAS|PubMed|Google Scholar
16.Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK.(2015)rising to the challenge of larger and richer datasets.Gigascience.
Article|CAS|PubMed|Google Scholar
17.Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007;81.(2007)org/10.1086/521987.: 1084.
10.1086/521987
Article|CAS|PubMed|Google Scholar
18.Wood PDP. Algebraic model of the lactation curve in cattle. Nature. 1967;216.(1967)org/10.1038/216164a0.: 164.
Article|CAS|PubMed|Google Scholar
19.McMillan I, Fitz-Earle M, Robson D. Quantitative genetics of fertility I. Lifetime egg production of Drosophila melanogaster—theoretical. Genetics. 1970;65.(1970)2.349.: 349.
Article|CAS|PubMed|Google Scholar
20.Yang N, Wu C, McMillan IAN. New mathematical model of poultry egg production. Poult Sci. 1989;68.(1989)3382/ps.0680476.: 476.
Article|CAS|PubMed|Google Scholar
21.Zakaria A, Miyaki T, Imai K. The relationships of clutch length and egg position on ovarian follicular growth in laying hens. Poult Sci. 1984;63.(1984)3382/ps.0631250.: 1250.
10.3382/ps.0631250
Article|CAS|PubMed|Google Scholar
22.Jiang L, Zheng Z, Qi T, Kemper KE, Wray NR, Visscher PM, et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nat Genet. 2019;51.(2019)org/10.1038/s41588-019-0530-8.: 1749.
10.1038/s41588-019-0530-8
Article|CAS|PubMed|Google Scholar
23.Yin L, Zhang H, Tang Z, Xu J, Yin D, Zhang Z, et al. rMVP.(2021)a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study.Genomics Proteomics Bioinformatics.: 619.
10.1016/j.gpb.2020.10.007
Article|CAS|PubMed|Google Scholar
24.Dong SS, He WM, Ji JJ, Zhang C, Guo Y, Yang TL. LDBlockShow.(2021)a fast and convenient tool for visualizing linkage disequilibrium and haplotype blocks based on variant call format files.Brief Bioinform.
Article|CAS|PubMed|Google Scholar
25.Ziyatdinov A, Vázquez-Santiago M, Brunel H, Martinez-Perez A, Aschard H, Soria JM. lme4qtl.(2018)linear mixed models with flexible covariance structure for genetic studies of related individuals.BMC Bioinformatics.: 68.
10.1186/s12859-018-2057-x
Article|CAS|PubMed|Google Scholar
26.Ferdosi MH, Henshall J, Tier B. Study of the optimum haplotype length to build genomic relationship matrices. Genet Sel Evol. 2016;48.(2016)org/10.1186/s12711-016-0253-6.: 75.
10.1186/s12711-016-0253-6
Article|CAS|PubMed|Google Scholar
27.Gu Z, Gong J, Zhu Z, Li Z, Feng Q, Wang C, et al. Structure and function of rice hybrid genomes reveal genetic basis and optimal performance of heterosis. Nat Genet. 2023;55.(2023)org/10.1038/s41588-023-01495-8.: 1745.
10.1038/s41588-023-01495-8
Article|CAS|PubMed|Google Scholar
28.Yin L, Zhang H, Tang Z, Yin D, Fu Y, Yuan X, et al. HIBLUP.(2023)an integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data.Nucleic Acids Res.: 3501.
10.1093/nar/gkad074
Article|CAS|PubMed|Google Scholar
29.Santiago E, Novo I, Pardiñas AF, Saura M, Wang J, Caballero A. Recent demographic history inferred by high-resolution analysis of linkage disequilibrium. Mol Biol Evol. 2020;37.(2020)org/10.1093/molbev/msaa169.: 3642.
10.1093/molbev/msaa169
Article|CAS|PubMed|Google Scholar
30.Maclean CA, Chue Hong NP, Prendergast JGD. Hapbin.(2015)an efficient program for performing haplotype-based scans for positive selection in large genomic datasets.Mol Biol Evol.: 3027.
10.1093/molbev/msv172
Article|CAS|PubMed|Google Scholar
31.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27.(2011)org/10.1093/bioinformatics/btr330.: 2156.
10.1093/bioinformatics/btr330
Article|CAS|PubMed|Google Scholar
32.Field Y, Boyle EA, Telis N, Gao Z, Gaulton KJ, Golan D, et al. Detection of human adaptation during the past 2000 years. Science. 2016;354.(2000)1126/science.aag0776.: 760.
10.1126/science.aag0776
Article|CAS|PubMed|Google Scholar
33.Luo H, Zhang P, Zhang W, Zheng Y, Hao D, Shi Y, et al. Recent positive selection signatures reveal phenotypic evolution in the Han Chinese population. Sci Bull (Beijing). 2023;68.(2023)2391–404. https://doi. org/10. 1016/j.scib.: 2391.
10.1016/j.scib.2023.08.027
Article|CAS|PubMed|Google Scholar
34.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37.(2019)org/10.1038/s41587-019-0201-4.: 907.
10.1038/s41587-019-0201-4
Article|CAS|PubMed|Google Scholar
35.Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33.(2015)1038/nbt.3122.: 290.
10.1038/nbt.3122
Article|CAS|PubMed|Google Scholar
36.Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10.(2019)org/10.1038/s41467-019-09234-6.: 1523.
10.1038/s41467-019-09234-6
Article|CAS|PubMed|Google Scholar
37.Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res. 2009;37.(2009)org/10.1093/nar/gkp427.
10.1093/nar/gkp427
Article|CAS|PubMed|Google Scholar
38.Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS Catalog.(2023)knowledgebase and deposition resource.Nucleic Acids Res.
10.1093/nar/gkac1010
Article|CAS|PubMed|Google Scholar
39.Warde-Farley D, Donaldson SL, Comes O, Zuberi K, Badrawi R, Chao P, et al. The GeneMANIA prediction server.(2010)biological network integration for gene prioritization and predicting gene function.Nucleic Acids Res.
10.1093/nar/gkq537
Article|CAS|PubMed|Google Scholar
40.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape.(2003)a software environment for integrated models of biomolecular interaction networks.Genome Res.: 2498.
10.1101/gr.1239303
Article|CAS|PubMed|Google Scholar
41.Bu L, Wang Y, Tan L, Wen Z, Hu X, Zhang Z, et al. Haplotype analysis incorporating ancestral origins identified novel genetic loci associated with chicken body weight using an advanced intercross line. Genet Sel Evol. 2024;56.(2024)org/10.1186/s12711-024-00946-y.: 78.
10.1186/s12711-024-00946-y
Article|CAS|PubMed|Google Scholar
42.Yang X, Tai Y, Ma Y, Xu Z, Hao J, Han D, et al. Cecum microbiome and metabolism characteristics of Silky Fowl and White Leghorn chicken in late laying stages. Front Microbiol. 2022;13.(2022)984654. https://doi. org/10.3389/fmicb.: 984654.
10.3389/fmicb.2022.984654
Article|CAS|PubMed|Google Scholar
43.Kanekiyo K, Inamori K-i, Kitazume S, Sato K, Maeda J, Higuchi M, et al. Loss of branched O-mannosyl glycans in astrocytes accelerates remyelination. J Neurosci. 2013;33.(2013)10037–47. https://doi. org/10. 1523/JNEUROSCI.3137-12.: 10037.
Article|CAS|PubMed|Google Scholar
44.Derese DB, Lu L, Shi F. Major regulatory factors for reproductive performances of female chickens. Asian Pac J Reprod. 2024;13.(2024)4103/apjr.apjr_62_24.: 197.
10.4103/apjr.apjr_62_24
Article|CAS|PubMed|Google Scholar
45.Farber MJ, Rizaldy R, Hildebrand JD. Shroom2 regulates contractility to control endothelial morphogenesis. Mol Biol Cell. 2011;22.(2011)1091/mbc.E10-06-0505.: 795.
10.1091/mbc.E10-06-0505
Article|CAS|PubMed|Google Scholar
46.Dietz ML, Bernaciak TM, Vendetti F, Kielec JM, Hildebrand JD. Differential actin-dependent localization modulates the evolutionarily conserved activity of Shroom family proteins. J Biol Chem. 2006;281.(2006)1074/jbc.m512463200.: 20542.
10.1074/jbc.m512463200
Article|CAS|PubMed|Google Scholar
47.Morgan JT, Pfeiffer ER, Thirkill TL, Kumar P, Peng G, Fridolfsson HN, et al. Nesprin-3 regulates endothelial cell morphology, perinuclear cytoskeletal architecture, and flow-induced polarization. Mol Biol Cell. 2011;22.(2011)1091/mbc.e11-04-0287.: 4324.
10.1091/mbc.e11-04-0287
Article|CAS|PubMed|Google Scholar
48.Geuss E, Schmitt J, Benavente R, Alsheimer M. Mammalian sperm head formation involves different polarization of two novel LINC complexes. PLoS ONE. 2010;5.(2010)pone.0012072.
10.1371/journal.pone.0012072
Article|CAS|PubMed|Google Scholar
49.Hankeln T, Ebner B, Fuchs C, Gerlach F, Haberkamp M, Laufs TL, et al. Neuroglobin and cytoglobin in search of their role in the vertebrate globin family. J Inorg Biochem. 2005;99.(2005)11.009.: 110.
Article|CAS|PubMed|Google Scholar
50.Trent JT, Hargrove MS. A ubiquitously expressed human hexacoordinate hemoglobin. J Biol Chem. 2002;277.(2002)1074/jbc.M201934200.: 19538.
10.1074/jbc.M201934200
Article|CAS|PubMed|Google Scholar
51.Fukushi D, Inaba M, Katoh K, Suzuki Y, Enokido Y, Nomura N, et al. R3HDM1 haploinsufficiency is associated with mild intellectual disability. Am J Med Genet A. 2021;185.(2021)a.62173.: 1776.
10.1002/ajmg.a.62173
Article|CAS|PubMed|Google Scholar
52.Adhikari B, Lee CN, Khadka VS, Deng Y, Fukumoto G, Thorne M, et al. RNA-Sequencing based analysis of bovine endometrium during the maternal recognition of pregnancy. BMC Genomics. 2022;23.(2022)org/10.1186/s12864-022-08720-4.: 494.
10.1186/s12864-022-08720-4
Article|CAS|PubMed|Google Scholar
53.Ciccia A, Nimonkar AV, Hu Y, Hajdu I, Achar YJ, Izhar L, et al. Polyubiquitinated PCNA recruits the ZRANB3 translocase to maintain genomic integrity after replication stress. Mol Cell. 2012;47.(2012)396–409. https://doi. org/10. 1016/j.molcel.: 396.
10.1016/j.molcel.2012.05.024
Article|CAS|PubMed|Google Scholar
54.Fujikawa Y, Yoshida H, Inoue T, Ohbayashi T, Noda K, Von Melchner H, et al. Latent TGF-β binding protein 2 and 4 have essential overlapping functions in microfibril development. Sci Rep. 2017;7.(2017)org/10.1038/srep43714.: 43714.
10.1038/srep43714
Article|CAS|PubMed|Google Scholar
55.Zhang H, Shen L-Y, Xu Z-C, Kramer LM, Yu J-Q, Zhang X-Y, et al. Haplotype-based genome-wide association studies for carcass and growth traits in chicken. Poult Sci. 2020;99.(2020)2349–61. https://doi. org/10. 1016/j.psj.: 2349.
10.1016/j.psj.2020.01.009
Article|CAS|PubMed|Google Scholar
56.Sato S, Uemoto Y, Kikuchi T, Egawa S, Kohira K, Saito T, et al. SNP- and haplotype-based genome-wide association studies for growth, carcass, and meat quality traits in a Duroc multigenerational population. BMC Genet. 2016;17.(2016)org/10.1186/s12863-016-0368-3.: 60.
10.1186/s12863-016-0368-3
Article|CAS|PubMed|Google Scholar
57.Chen Z, Yao Y, Ma P, Wang Q, Pan Y. Haplotype-based genome-wide association study identifies loci and candidate genes for milk yield in Holsteins. PLoS ONE. 2018;13.(2018)pone.0192695.
10.1371/journal.pone.0192695
Article|CAS|PubMed|Google Scholar
58.Bovo S, Ballan M, Schiavo G, Gallo M, Dall'Olio S, Fontanesi L. Haplotype-based genome-wide association studies reveal new loci for haematological and clinical-biochemical parameters in Large White pigs. Anim Genet. 2020;51.(2020)601–6.Epub.: 601.
Article|CAS|PubMed|Google Scholar
59.Yeaman S. Evolution of polygenic traits under global vs local adaptation. Genetics. 2022;220.(2022)org/10.1093/genetics/iyab134.
Article|CAS|PubMed|Google Scholar
60.Pritchard JK, Di Rienzo A. Adaptation – not by sweeps alone. Nat Rev Genet. 2010;11.(2010)org/10.1038/nrg2880.: 665.
10.1038/nrg2880
Article|CAS|PubMed|Google Scholar
61.Graves JL Jr, Hertweck KL, Phillips MA, Han MV, Cabral LG, Barter TT, et al. Genomics of parallel experimental evolution in Drosophila. Mol Biol Evol. 2017;34.(2017)org/10.1093/molbev/msw282.: 831.
10.1093/molbev/msw282
Article|CAS|PubMed|Google Scholar
62.Teotónio H, Chelo IM, Bradić M, Rose MR, Long AD. Experimental evolution reveals natural selection on standing genetic variation. Nat Genet. 2009;41.(2009)1038/ng.289.: 251.
10.1038/ng.289
Article|CAS|PubMed|Google Scholar
63.Burke MK, Liti G, Long AD. Standing genetic variation drives repeatable experimental evolution in outcrossing populations of Saccharomyces cerevisiae. Mol Biol Evol. 2014;31(12).(2014)org/10.1093/molbev/msu256.: 3228.
10.1093/molbev/msu256
Article|CAS|PubMed|Google Scholar
64.Bain MM, Nys Y, Dunn IC. Increasing persistency in lay and stabilising egg quality in longer laying cycles. What are the challenges? Br Poult Sci. 2016;57.(2016)330–8. https://doi. org/10.1080/00071668.: 330.
Article|CAS|PubMed|Google Scholar
65.Franken GA, Seker M, Bos C, Siemons LA, van der Eerden BC, Christ A, et al. Cyclin M2 (CNNM2) knockout mice show mild hypomagnesaemia and developmental defects. Sci Rep. 2021;11.(2021)org/10.1038/s41598-021-87548-6.: 8217.
10.1038/s41598-021-87548-6
Article|CAS|PubMed|Google Scholar
66.Lindqvist C, Schütz K, Jensen P. Red jungle fowl have more contrafreeloading than white leghorn layers.(2002)Effect of food deprivation and consequences for information gain.Behaviour.: 1195.
10.1163/15685390260437335
Article|CAS|PubMed|Google Scholar
67.Lindqvist C, Janczak AM, Nätt D, Baranowska I, Lindqvist N, Wichman A, et al. Transmission of stress-induced learning impairment and associated brain gene expression from parents to offspring in chickens. PLoS ONE. 2007;2.(2007)pone.0000364.
10.1371/journal.pone.0000364
Article|CAS|PubMed|Google Scholar
68.Kirkden RD, Lindqvist C, Jensen P. Effects of domestication on filial motivation and imprinting in chicks.(2008)comparison of red junglefowl and White Leghorns.Anim Behav.: 287.
10.1016/j.anbehav.2008.02.007
Article|CAS|PubMed|Google Scholar
69.Lindqvist C, Jensen P. Domestication and stress effects on contrafreeloading and spatial learning performance in red jungle fowl (Gallus gallus) and White Leghorn layers. Behav Processes. 2009;81.(2009)80–4. https://doi. org/10. 1016/j.beproc.: 80.
10.1016/j.beproc.2009.02.005
Article|CAS|PubMed|Google Scholar
70.Gjøen J, Jean-Joseph H, Kotrschal K, Jensen P. Domestication and social environment modulate fear responses in young chickens. Behav Processes. 2023;210.(2023)104906. https://doi. org/10. 1016/j.beproc.: 104906.
10.1016/j.beproc.2023.104906
Article|CAS|PubMed|Google Scholar
71.Zhou D-Y, Su X, Wu Y, Yang Y, Zhang L, Cheng S, et al. Decreased CNNM2 expression in prefrontal cortex affects sensorimotor gating function, cognition, dendritic spine morphogenesis and risk of schizophrenia. Neuropsychopharmacology. 2024;49.(2024)org/10.1038/s41386-023-01732-y.: 433.
10.1038/s41386-023-01732-y
Article|CAS|PubMed|Google Scholar
72.Bayle D, Coudy-Gandilhon C, Gueugneau M, Castiglioni S, Zocchi M, Maj-Zurawska M, et al. Magnesium deficiency alters expression of genes critical for muscle magnesium homeostasis and physiology in mice. Nutrients. 2021;13.(2021)org/10.3390/nu13072169.: 2169.
10.3390/nu13072169
Article|CAS|PubMed|Google Scholar
73.Stockebrand M, Sasani A, Das D, Hornig S, Hermans-Borgmeyer I, Lake HA, et al. A mouse model of creatine transporter deficiency reveals impaired motor function and muscle energy metabolism. Front Physiol. 2018;9.(2018)773. https://doi. org/10.3389/fphys.: 773.
10.3389/fphys.2018.00773
Article|CAS|PubMed|Google Scholar
74.Chehab FF, Lim ME, Lu R. Correction of the sterility defect in homozygous obese female mice by treatment with the human recombinant leptin. Nat Genet. 1996;12.(1996)org/10.1038/ng0396-318.: 318.
10.1038/ng0396-318
Article|CAS|PubMed|Google Scholar
75.Chehab FF, Mounzih K, Lu R, Lim ME. Early onset of reproductive function in normal female mice treated with leptin. Science. 1997;275.(1997)5296.88.: 88.
10.1126/science.275.5296.88
Article|CAS|PubMed|Google Scholar
76.Cunningham MJ, Clifton DK, Steiner RA. Leptin’s actions on the reproductive axis.(1999)perspectives and mechanisms.Biol Reprod.: 216.
10.1095/biolreprod60.2.216
Article|CAS|PubMed|Google Scholar
77.Paczoska-Eliasiewicz H, Gertler A, Proszkowiec M, Proudman J, Hrabia A, Sechman A, et al. Attenuation by leptin of the effects of fasting on ovarian function in hens (Gallus domesticus). Reproduction. 2003;126.(2003)0.1260739.: 739.
10.1530/rep.0.1260739
Article|CAS|PubMed|Google Scholar
78.Zachow RJ, Magoffin DA. Direct intraovarian effects of leptin.(1997)impairment of the synergistic action of insulin-like growth factor-i on follicle-stimulating hormone-dependent estradiol-17β production by rat ovarian granulosa cells.Endocrinology.: 847.
10.1210/endo.138.2.5035
Article|CAS|PubMed|Google Scholar
79.Zachow RJ, Weitsman SR, Magoffin DA. Leptin impairs the synergistic stimulation by transforming growth factor-β of follicle-stimulating hormone-dependent aromatase activity and messenger ribonucleic acid expression in rat ovarian granulosa cells. Biol Reprod. 1999;61.(1999)4.1104.: 1104.
10.1095/biolreprod61.4.1104
Article|CAS|PubMed|Google Scholar
80.Agarwal SK, Vogel K, Weitsman SR, Magoffin DA. Leptin antagonizes the insulin-like growth factor-I augmentation of steroidogenesis in granulosa and theca cells of the human ovary. J Clin Endocrinol Metab. 1999;84.(1999)3.5543.: 1072.
10.1210/jcem.84.3.5543
Article|CAS|PubMed|Google Scholar
81.Anderson R, Fässler R, Georges-Labouesse E, Hynes RO, Bader BL, Kreidberg JA, et al. Mouse primordial germ cells lacking β1 integrins enter the germline but fail to migrate normally to the gonads. Development. 1999;126.(1999)8.1655.: 1655.
10.1242/dev.126.8.1655
Article|CAS|PubMed|Google Scholar
82.Shinohara T, Avarbock MR, Brinster RL. β1- and α6-integrin are surface markers on mouse spermatogonial stem cells. Proc Natl Acad Sci U S A. 1999;96.(1999)10.5504.: 5504.
10.1073/pnas.96.10.5504
Article|CAS|PubMed|Google Scholar
83.Xu R, Qin N, Xu X, Sun X, Chen X, Zhao J. Implication of SLIT3-ROBO1/ROBO2 in granulosa cell proliferation, differentiation and follicle selection in the prehierarchical follicles of hen ovary. Cell Biol Int. 2018;42.(2018)1002/cbin.11063.: 1643.
10.1002/cbin.11063
Article|CAS|PubMed|Google Scholar
84.Tian M, Hagg T, Denisova N, Knusel B, Engvall E, Jucker M. Laminin-α2 chain-like antigens in CNS dendritic spines. Brain Res. 1997;764.(1997)org/10.1016/s0006-8993(97)00420-4.: 28.
10.1016/s0006-8993(97)00420-4
Article|CAS|PubMed|Google Scholar
85.Maier S, Paulsson M, Hartmann U. The widely expressed extracellular matrix protein SMOC-2 promotes keratinocyte attachment and migration. Exp Cell Res. 2008;314.(2008)2477–87. https://doi. org/10. 1016/j.yexcr.: 2477.
10.1016/j.yexcr.2008.05.020
Article|CAS|PubMed|Google Scholar
86.Pazin DE, Albrecht KH. Developmental expression of Smoc1 and Smoc2 suggests potential roles in fetal gonad and reproductive tract differentiation. Dev Dyn. 2009;238.(2009)1002/dvdy.22124.: 2877.
10.1002/dvdy.22124
Article|CAS|PubMed|Google Scholar
87.Zhang W, Reeves GR, Tautz D. Testing implications of the omnigenic model for the genetic analysis of loci identified through genome-wide association. Curr Biol. 2021;31(1092–8).(2021)12.023.
10.1016/j.cub.2020.12.023
Article|CAS|PubMed|Google Scholar
88.Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits.(2017)from polygenic to omnigenic.Cell.: 1177.
Article|CAS|PubMed|Google Scholar

Acknowledgements

We thank the support of the Xihe high-performance computing platform of the National Research Facility for Phenotypic and Genotypic Analysis of Model Animals (Beijing), China Agricultural University. This work was supported by National Key Research and Development Program of China (2022YFF1000204 and 2021YFD1200803).

Funding

This work was supported by National Key Research and Development Program of China (2022YFF1000204 and 2021YFD1200803).

Ethics Declaration

Ethics approval and consent to participate

All animal experimental protocols were approved by the Institutional Animal Care and Use Committee of Henan Agricultural University (protocol number 11-0085). The methods were carried out in accordance with the approved guidelines.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Rights and Permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ . The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/ ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. Reprints and permissions

[1] 1.van der Klein SA, Zuidhof MJ, Bédécarrats GY. Diurnal and seasonal dynamics affecting egg production in meat chickens.(2020)A review of mechanisms associated with reproductive dysregulation.Anim Reprod Sci.: 106257.
10.1016/j.anireprosci.2019.106257
Article|CAS|PubMed|Google Scholar

[2] 2.Fu M, Wu Y, Shen J, Pan A, Zhang H, Sun J, et al. Genome-wide association study of egg production traits in shuanglian chickens using whole genome sequencing. Genes. 2023;14.(2023)org/10.3390/genes14122129.: 2129.
10.3390/genes14122129
Article|CAS|PubMed|Google Scholar

[3] 3.Chen A, Zhao X, Wen J, Zhao X, Wang G, Zhang X, et al. Genetic parameter estimation and molecular foundation of chicken egg-laying trait. Poult Sci. 2024;103.(2024)103627. https://doi. org/10. 1016/j.psj.: 103627.
10.1016/j.psj.2024.103627
Article|CAS|PubMed|Google Scholar

[4] 4.Wolc A, Jankowski T, Arango J, Settar P, Fulton J, O’Sullivan N, et al. Investigating the genetic determination of clutch traits in laying hens. Poult Sci. 2019;98.(2019)org/10.3382/ps/pey354.: 39.
10.3382/ps/pey354
Article|CAS|PubMed|Google Scholar

[5] 5.Wang J, Liu Z, Cao D, Liu J, Li F, Han H, et al. Elucidation of the genetic determination of clutch traits in Chinese local chickens of the Laiwu Black breed. BMC Genomics. 2023;24.(2023)org/10.1186/s12864-023-09798-0.: 686.
10.1186/s12864-023-09798-0
Article|CAS|PubMed|Google Scholar

[6] 6.Du Y, Liu L, He Y, Dou T, Jia J, Ge C. Endocrine and genetic factors affecting egg laying performance in chickens.(2020)A review.Br Poult Sci.: 538.
10.1080/00071668.2020.1758299
Article|CAS|PubMed|Google Scholar

[7] 7.Wang D, Tan L, Zhi Y, Bu L, Wang Y, Wang Z, et al. Genome-wide variation study and inter-tissue communication analysis unveil regulatory mechanisms of egg-laying performance in chickens. Nat Commun. 2024;15.(2024)org/10.1038/s41467-024-50809-9.: 7069.
10.1038/s41467-024-50809-9
Article|CAS|PubMed|Google Scholar

[8] 8.Narinc D, Uckardes F, Aslan E. Egg production curve analyses in poultry science. World Poultry Sci J. 2014;70.(2014)org/10.1017/S0043933914000877.: 817.
10.1017/S0043933914000877
Article|CAS|PubMed|Google Scholar

[9] 9.Liu Z, Yang N, Yan Y, Li G, Liu A, Wu G, et al. Genome-wide association analysis of egg production performance in chickens across the whole laying period. BMC Genet. 2019;20.(2019)org/10.1186/s12863-019-0771-7.: 67.
10.1186/s12863-019-0771-7
Article|CAS|PubMed|Google Scholar

[10] 10.Sella G, Barton NH. Thinking about the evolution of complex traits in the era of genome-wide association studies. Annu Rev Genomics Hum Genet. 2019;20.(2019)org/10.1146/annurev-genom-083115-022316.: 461.
10.1146/annurev-genom-083115-022316
Article|CAS|PubMed|Google Scholar

[11] 11.Sun C, Qu L, Yi G, Yuan J, Duan Z, Shen M, et al. Genome-wide association study revealed a promising region and candidate genes for eggshell quality in an F2 resource population. BMC Genomics. 2015;16.(2015)org/10.1186/s12864-015-1795-7.: 565.
10.1186/s12864-015-1795-7
Article|CAS|PubMed|Google Scholar

[12] 12.Chen S. Ultrafast one-pass FASTQ data preprocessing, quality control, and deduplication using fastp. iMeta. 2023;2.(2023)1002/imt2.107.
Article|CAS|PubMed|Google Scholar

[13] 13.Bu L, Wang Q, Gu W, Yang R, Zhu D, Song Z, et al. Improving read alignment through the generation of alternative reference via iterative strategy. Sci Rep. 2020;10.(2020)org/10.1038/s41598-020-74526-7.: 18712.
10.1038/s41598-020-74526-7
Article|CAS|PubMed|Google Scholar

[14] 14.Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, et al. Twelve years of SAMtools and BCFtools. Gigascience. 2021;10.(2021)org/10.1093/gigascience/giab008.
Article|CAS|PubMed|Google Scholar

[15] 15.Yang R, Guo X, Zhu D, Tan C, Bian C, Ren J, et al. Accelerated deciphering of the genetic architecture of agricultural economic traits in pigs using a low-coverage whole-genome sequencing strategy. Gigascience. 2021;10.(2021)org/10.1093/gigascience/giab048.
Article|CAS|PubMed|Google Scholar

[16] 16.Chang CC, Chow CC, Tellier LCAM, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK.(2015)rising to the challenge of larger and richer datasets.Gigascience.
Article|CAS|PubMed|Google Scholar

[17] 17.Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007;81.(2007)org/10.1086/521987.: 1084.
10.1086/521987
Article|CAS|PubMed|Google Scholar

[18] 18.Wood PDP. Algebraic model of the lactation curve in cattle. Nature. 1967;216.(1967)org/10.1038/216164a0.: 164.
Article|CAS|PubMed|Google Scholar

[19] 19.McMillan I, Fitz-Earle M, Robson D. Quantitative genetics of fertility I. Lifetime egg production of Drosophila melanogaster—theoretical. Genetics. 1970;65.(1970)2.349.: 349.
Article|CAS|PubMed|Google Scholar

[20] 20.Yang N, Wu C, McMillan IAN. New mathematical model of poultry egg production. Poult Sci. 1989;68.(1989)3382/ps.0680476.: 476.
Article|CAS|PubMed|Google Scholar

[21] 21.Zakaria A, Miyaki T, Imai K. The relationships of clutch length and egg position on ovarian follicular growth in laying hens. Poult Sci. 1984;63.(1984)3382/ps.0631250.: 1250.
10.3382/ps.0631250
Article|CAS|PubMed|Google Scholar

[22] 22.Jiang L, Zheng Z, Qi T, Kemper KE, Wray NR, Visscher PM, et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nat Genet. 2019;51.(2019)org/10.1038/s41588-019-0530-8.: 1749.
10.1038/s41588-019-0530-8
Article|CAS|PubMed|Google Scholar

[23] 23.Yin L, Zhang H, Tang Z, Xu J, Yin D, Zhang Z, et al. rMVP.(2021)a memory-efficient, visualization-enhanced, and parallel-accelerated tool for genome-wide association study.Genomics Proteomics Bioinformatics.: 619.
10.1016/j.gpb.2020.10.007
Article|CAS|PubMed|Google Scholar

[24] 24.Dong SS, He WM, Ji JJ, Zhang C, Guo Y, Yang TL. LDBlockShow.(2021)a fast and convenient tool for visualizing linkage disequilibrium and haplotype blocks based on variant call format files.Brief Bioinform.
Article|CAS|PubMed|Google Scholar

[25] 25.Ziyatdinov A, Vázquez-Santiago M, Brunel H, Martinez-Perez A, Aschard H, Soria JM. lme4qtl.(2018)linear mixed models with flexible covariance structure for genetic studies of related individuals.BMC Bioinformatics.: 68.
10.1186/s12859-018-2057-x
Article|CAS|PubMed|Google Scholar

[26] 26.Ferdosi MH, Henshall J, Tier B. Study of the optimum haplotype length to build genomic relationship matrices. Genet Sel Evol. 2016;48.(2016)org/10.1186/s12711-016-0253-6.: 75.
10.1186/s12711-016-0253-6
Article|CAS|PubMed|Google Scholar

[27] 27.Gu Z, Gong J, Zhu Z, Li Z, Feng Q, Wang C, et al. Structure and function of rice hybrid genomes reveal genetic basis and optimal performance of heterosis. Nat Genet. 2023;55.(2023)org/10.1038/s41588-023-01495-8.: 1745.
10.1038/s41588-023-01495-8
Article|CAS|PubMed|Google Scholar

[28] 28.Yin L, Zhang H, Tang Z, Yin D, Fu Y, Yuan X, et al. HIBLUP.(2023)an integration of statistical models on the BLUP framework for efficient genetic evaluation using big genomic data.Nucleic Acids Res.: 3501.
10.1093/nar/gkad074
Article|CAS|PubMed|Google Scholar

[29] 29.Santiago E, Novo I, Pardiñas AF, Saura M, Wang J, Caballero A. Recent demographic history inferred by high-resolution analysis of linkage disequilibrium. Mol Biol Evol. 2020;37.(2020)org/10.1093/molbev/msaa169.: 3642.
10.1093/molbev/msaa169
Article|CAS|PubMed|Google Scholar

[30] 30.Maclean CA, Chue Hong NP, Prendergast JGD. Hapbin.(2015)an efficient program for performing haplotype-based scans for positive selection in large genomic datasets.Mol Biol Evol.: 3027.
10.1093/molbev/msv172
Article|CAS|PubMed|Google Scholar

[31] 31.Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27.(2011)org/10.1093/bioinformatics/btr330.: 2156.
10.1093/bioinformatics/btr330
Article|CAS|PubMed|Google Scholar

[32] 32.Field Y, Boyle EA, Telis N, Gao Z, Gaulton KJ, Golan D, et al. Detection of human adaptation during the past 2000 years. Science. 2016;354.(2000)1126/science.aag0776.: 760.
10.1126/science.aag0776
Article|CAS|PubMed|Google Scholar

[33] 33.Luo H, Zhang P, Zhang W, Zheng Y, Hao D, Shi Y, et al. Recent positive selection signatures reveal phenotypic evolution in the Han Chinese population. Sci Bull (Beijing). 2023;68.(2023)2391–404. https://doi. org/10. 1016/j.scib.: 2391.
10.1016/j.scib.2023.08.027
Article|CAS|PubMed|Google Scholar

[34] 34.Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37.(2019)org/10.1038/s41587-019-0201-4.: 907.
10.1038/s41587-019-0201-4
Article|CAS|PubMed|Google Scholar

[35] 35.Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33.(2015)1038/nbt.3122.: 290.
10.1038/nbt.3122
Article|CAS|PubMed|Google Scholar

[36] 36.Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10.(2019)org/10.1038/s41467-019-09234-6.: 1523.
10.1038/s41467-019-09234-6
Article|CAS|PubMed|Google Scholar

[37] 37.Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res. 2009;37.(2009)org/10.1093/nar/gkp427.
10.1093/nar/gkp427
Article|CAS|PubMed|Google Scholar

[38] 38.Sollis E, Mosaku A, Abid A, Buniello A, Cerezo M, Gil L, et al. The NHGRI-EBI GWAS Catalog.(2023)knowledgebase and deposition resource.Nucleic Acids Res.
10.1093/nar/gkac1010
Article|CAS|PubMed|Google Scholar

[39] 39.Warde-Farley D, Donaldson SL, Comes O, Zuberi K, Badrawi R, Chao P, et al. The GeneMANIA prediction server.(2010)biological network integration for gene prioritization and predicting gene function.Nucleic Acids Res.
10.1093/nar/gkq537
Article|CAS|PubMed|Google Scholar

[40] 40.Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape.(2003)a software environment for integrated models of biomolecular interaction networks.Genome Res.: 2498.
10.1101/gr.1239303
Article|CAS|PubMed|Google Scholar

[41] 41.Bu L, Wang Y, Tan L, Wen Z, Hu X, Zhang Z, et al. Haplotype analysis incorporating ancestral origins identified novel genetic loci associated with chicken body weight using an advanced intercross line. Genet Sel Evol. 2024;56.(2024)org/10.1186/s12711-024-00946-y.: 78.
10.1186/s12711-024-00946-y
Article|CAS|PubMed|Google Scholar

[42] 42.Yang X, Tai Y, Ma Y, Xu Z, Hao J, Han D, et al. Cecum microbiome and metabolism characteristics of Silky Fowl and White Leghorn chicken in late laying stages. Front Microbiol. 2022;13.(2022)984654. https://doi. org/10.3389/fmicb.: 984654.
10.3389/fmicb.2022.984654
Article|CAS|PubMed|Google Scholar

[43] 43.Kanekiyo K, Inamori K-i, Kitazume S, Sato K, Maeda J, Higuchi M, et al. Loss of branched O-mannosyl glycans in astrocytes accelerates remyelination. J Neurosci. 2013;33.(2013)10037–47. https://doi. org/10. 1523/JNEUROSCI.3137-12.: 10037.
Article|CAS|PubMed|Google Scholar

[44] 44.Derese DB, Lu L, Shi F. Major regulatory factors for reproductive performances of female chickens. Asian Pac J Reprod. 2024;13.(2024)4103/apjr.apjr_62_24.: 197.
10.4103/apjr.apjr_62_24
Article|CAS|PubMed|Google Scholar

[45] 45.Farber MJ, Rizaldy R, Hildebrand JD. Shroom2 regulates contractility to control endothelial morphogenesis. Mol Biol Cell. 2011;22.(2011)1091/mbc.E10-06-0505.: 795.
10.1091/mbc.E10-06-0505
Article|CAS|PubMed|Google Scholar

[46] 46.Dietz ML, Bernaciak TM, Vendetti F, Kielec JM, Hildebrand JD. Differential actin-dependent localization modulates the evolutionarily conserved activity of Shroom family proteins. J Biol Chem. 2006;281.(2006)1074/jbc.m512463200.: 20542.
10.1074/jbc.m512463200
Article|CAS|PubMed|Google Scholar

[47] 47.Morgan JT, Pfeiffer ER, Thirkill TL, Kumar P, Peng G, Fridolfsson HN, et al. Nesprin-3 regulates endothelial cell morphology, perinuclear cytoskeletal architecture, and flow-induced polarization. Mol Biol Cell. 2011;22.(2011)1091/mbc.e11-04-0287.: 4324.
10.1091/mbc.e11-04-0287
Article|CAS|PubMed|Google Scholar

[48] 48.Geuss E, Schmitt J, Benavente R, Alsheimer M. Mammalian sperm head formation involves different polarization of two novel LINC complexes. PLoS ONE. 2010;5.(2010)pone.0012072.
10.1371/journal.pone.0012072
Article|CAS|PubMed|Google Scholar

[49] 49.Hankeln T, Ebner B, Fuchs C, Gerlach F, Haberkamp M, Laufs TL, et al. Neuroglobin and cytoglobin in search of their role in the vertebrate globin family. J Inorg Biochem. 2005;99.(2005)11.009.: 110.
Article|CAS|PubMed|Google Scholar

[50] 50.Trent JT, Hargrove MS. A ubiquitously expressed human hexacoordinate hemoglobin. J Biol Chem. 2002;277.(2002)1074/jbc.M201934200.: 19538.
10.1074/jbc.M201934200
Article|CAS|PubMed|Google Scholar

[51] 51.Fukushi D, Inaba M, Katoh K, Suzuki Y, Enokido Y, Nomura N, et al. R3HDM1 haploinsufficiency is associated with mild intellectual disability. Am J Med Genet A. 2021;185.(2021)a.62173.: 1776.
10.1002/ajmg.a.62173
Article|CAS|PubMed|Google Scholar

[52] 52.Adhikari B, Lee CN, Khadka VS, Deng Y, Fukumoto G, Thorne M, et al. RNA-Sequencing based analysis of bovine endometrium during the maternal recognition of pregnancy. BMC Genomics. 2022;23.(2022)org/10.1186/s12864-022-08720-4.: 494.
10.1186/s12864-022-08720-4
Article|CAS|PubMed|Google Scholar

[53] 53.Ciccia A, Nimonkar AV, Hu Y, Hajdu I, Achar YJ, Izhar L, et al. Polyubiquitinated PCNA recruits the ZRANB3 translocase to maintain genomic integrity after replication stress. Mol Cell. 2012;47.(2012)396–409. https://doi. org/10. 1016/j.molcel.: 396.
10.1016/j.molcel.2012.05.024
Article|CAS|PubMed|Google Scholar

[54] 54.Fujikawa Y, Yoshida H, Inoue T, Ohbayashi T, Noda K, Von Melchner H, et al. Latent TGF-β binding protein 2 and 4 have essential overlapping functions in microfibril development. Sci Rep. 2017;7.(2017)org/10.1038/srep43714.: 43714.
10.1038/srep43714
Article|CAS|PubMed|Google Scholar

[55] 55.Zhang H, Shen L-Y, Xu Z-C, Kramer LM, Yu J-Q, Zhang X-Y, et al. Haplotype-based genome-wide association studies for carcass and growth traits in chicken. Poult Sci. 2020;99.(2020)2349–61. https://doi. org/10. 1016/j.psj.: 2349.
10.1016/j.psj.2020.01.009
Article|CAS|PubMed|Google Scholar

[56] 56.Sato S, Uemoto Y, Kikuchi T, Egawa S, Kohira K, Saito T, et al. SNP- and haplotype-based genome-wide association studies for growth, carcass, and meat quality traits in a Duroc multigenerational population. BMC Genet. 2016;17.(2016)org/10.1186/s12863-016-0368-3.: 60.
10.1186/s12863-016-0368-3
Article|CAS|PubMed|Google Scholar

[57] 57.Chen Z, Yao Y, Ma P, Wang Q, Pan Y. Haplotype-based genome-wide association study identifies loci and candidate genes for milk yield in Holsteins. PLoS ONE. 2018;13.(2018)pone.0192695.
10.1371/journal.pone.0192695
Article|CAS|PubMed|Google Scholar

[58] 58.Bovo S, Ballan M, Schiavo G, Gallo M, Dall'Olio S, Fontanesi L. Haplotype-based genome-wide association studies reveal new loci for haematological and clinical-biochemical parameters in Large White pigs. Anim Genet. 2020;51.(2020)601–6.Epub.: 601.
Article|CAS|PubMed|Google Scholar

[59] 59.Yeaman S. Evolution of polygenic traits under global vs local adaptation. Genetics. 2022;220.(2022)org/10.1093/genetics/iyab134.
Article|CAS|PubMed|Google Scholar

[60] 60.Pritchard JK, Di Rienzo A. Adaptation – not by sweeps alone. Nat Rev Genet. 2010;11.(2010)org/10.1038/nrg2880.: 665.
10.1038/nrg2880
Article|CAS|PubMed|Google Scholar

[61] 61.Graves JL Jr, Hertweck KL, Phillips MA, Han MV, Cabral LG, Barter TT, et al. Genomics of parallel experimental evolution in Drosophila. Mol Biol Evol. 2017;34.(2017)org/10.1093/molbev/msw282.: 831.
10.1093/molbev/msw282
Article|CAS|PubMed|Google Scholar

[62] 62.Teotónio H, Chelo IM, Bradić M, Rose MR, Long AD. Experimental evolution reveals natural selection on standing genetic variation. Nat Genet. 2009;41.(2009)1038/ng.289.: 251.
10.1038/ng.289
Article|CAS|PubMed|Google Scholar

[63] 63.Burke MK, Liti G, Long AD. Standing genetic variation drives repeatable experimental evolution in outcrossing populations of Saccharomyces cerevisiae. Mol Biol Evol. 2014;31(12).(2014)org/10.1093/molbev/msu256.: 3228.
10.1093/molbev/msu256
Article|CAS|PubMed|Google Scholar

[64] 64.Bain MM, Nys Y, Dunn IC. Increasing persistency in lay and stabilising egg quality in longer laying cycles. What are the challenges? Br Poult Sci. 2016;57.(2016)330–8. https://doi. org/10.1080/00071668.: 330.
Article|CAS|PubMed|Google Scholar

[65] 65.Franken GA, Seker M, Bos C, Siemons LA, van der Eerden BC, Christ A, et al. Cyclin M2 (CNNM2) knockout mice show mild hypomagnesaemia and developmental defects. Sci Rep. 2021;11.(2021)org/10.1038/s41598-021-87548-6.: 8217.
10.1038/s41598-021-87548-6
Article|CAS|PubMed|Google Scholar

[66] 66.Lindqvist C, Schütz K, Jensen P. Red jungle fowl have more contrafreeloading than white leghorn layers.(2002)Effect of food deprivation and consequences for information gain.Behaviour.: 1195.
10.1163/15685390260437335
Article|CAS|PubMed|Google Scholar

[67] 67.Lindqvist C, Janczak AM, Nätt D, Baranowska I, Lindqvist N, Wichman A, et al. Transmission of stress-induced learning impairment and associated brain gene expression from parents to offspring in chickens. PLoS ONE. 2007;2.(2007)pone.0000364.
10.1371/journal.pone.0000364
Article|CAS|PubMed|Google Scholar

[68] 68.Kirkden RD, Lindqvist C, Jensen P. Effects of domestication on filial motivation and imprinting in chicks.(2008)comparison of red junglefowl and White Leghorns.Anim Behav.: 287.
10.1016/j.anbehav.2008.02.007
Article|CAS|PubMed|Google Scholar

[69] 69.Lindqvist C, Jensen P. Domestication and stress effects on contrafreeloading and spatial learning performance in red jungle fowl (Gallus gallus) and White Leghorn layers. Behav Processes. 2009;81.(2009)80–4. https://doi. org/10. 1016/j.beproc.: 80.
10.1016/j.beproc.2009.02.005
Article|CAS|PubMed|Google Scholar

[70] 70.Gjøen J, Jean-Joseph H, Kotrschal K, Jensen P. Domestication and social environment modulate fear responses in young chickens. Behav Processes. 2023;210.(2023)104906. https://doi. org/10. 1016/j.beproc.: 104906.
10.1016/j.beproc.2023.104906
Article|CAS|PubMed|Google Scholar

[71] 71.Zhou D-Y, Su X, Wu Y, Yang Y, Zhang L, Cheng S, et al. Decreased CNNM2 expression in prefrontal cortex affects sensorimotor gating function, cognition, dendritic spine morphogenesis and risk of schizophrenia. Neuropsychopharmacology. 2024;49.(2024)org/10.1038/s41386-023-01732-y.: 433.
10.1038/s41386-023-01732-y
Article|CAS|PubMed|Google Scholar

[72] 72.Bayle D, Coudy-Gandilhon C, Gueugneau M, Castiglioni S, Zocchi M, Maj-Zurawska M, et al. Magnesium deficiency alters expression of genes critical for muscle magnesium homeostasis and physiology in mice. Nutrients. 2021;13.(2021)org/10.3390/nu13072169.: 2169.
10.3390/nu13072169
Article|CAS|PubMed|Google Scholar

[73] 73.Stockebrand M, Sasani A, Das D, Hornig S, Hermans-Borgmeyer I, Lake HA, et al. A mouse model of creatine transporter deficiency reveals impaired motor function and muscle energy metabolism. Front Physiol. 2018;9.(2018)773. https://doi. org/10.3389/fphys.: 773.
10.3389/fphys.2018.00773
Article|CAS|PubMed|Google Scholar

[74] 74.Chehab FF, Lim ME, Lu R. Correction of the sterility defect in homozygous obese female mice by treatment with the human recombinant leptin. Nat Genet. 1996;12.(1996)org/10.1038/ng0396-318.: 318.
10.1038/ng0396-318
Article|CAS|PubMed|Google Scholar

[75] 75.Chehab FF, Mounzih K, Lu R, Lim ME. Early onset of reproductive function in normal female mice treated with leptin. Science. 1997;275.(1997)5296.88.: 88.
10.1126/science.275.5296.88
Article|CAS|PubMed|Google Scholar

[76] 76.Cunningham MJ, Clifton DK, Steiner RA. Leptin’s actions on the reproductive axis.(1999)perspectives and mechanisms.Biol Reprod.: 216.
10.1095/biolreprod60.2.216
Article|CAS|PubMed|Google Scholar

[77] 77.Paczoska-Eliasiewicz H, Gertler A, Proszkowiec M, Proudman J, Hrabia A, Sechman A, et al. Attenuation by leptin of the effects of fasting on ovarian function in hens (Gallus domesticus). Reproduction. 2003;126.(2003)0.1260739.: 739.
10.1530/rep.0.1260739
Article|CAS|PubMed|Google Scholar

[78] 78.Zachow RJ, Magoffin DA. Direct intraovarian effects of leptin.(1997)impairment of the synergistic action of insulin-like growth factor-i on follicle-stimulating hormone-dependent estradiol-17β production by rat ovarian granulosa cells.Endocrinology.: 847.
10.1210/endo.138.2.5035
Article|CAS|PubMed|Google Scholar

[79] 79.Zachow RJ, Weitsman SR, Magoffin DA. Leptin impairs the synergistic stimulation by transforming growth factor-β of follicle-stimulating hormone-dependent aromatase activity and messenger ribonucleic acid expression in rat ovarian granulosa cells. Biol Reprod. 1999;61.(1999)4.1104.: 1104.
10.1095/biolreprod61.4.1104
Article|CAS|PubMed|Google Scholar

[80] 80.Agarwal SK, Vogel K, Weitsman SR, Magoffin DA. Leptin antagonizes the insulin-like growth factor-I augmentation of steroidogenesis in granulosa and theca cells of the human ovary. J Clin Endocrinol Metab. 1999;84.(1999)3.5543.: 1072.
10.1210/jcem.84.3.5543
Article|CAS|PubMed|Google Scholar

[81] 81.Anderson R, Fässler R, Georges-Labouesse E, Hynes RO, Bader BL, Kreidberg JA, et al. Mouse primordial germ cells lacking β1 integrins enter the germline but fail to migrate normally to the gonads. Development. 1999;126.(1999)8.1655.: 1655.
10.1242/dev.126.8.1655
Article|CAS|PubMed|Google Scholar

[82] 82.Shinohara T, Avarbock MR, Brinster RL. β1- and α6-integrin are surface markers on mouse spermatogonial stem cells. Proc Natl Acad Sci U S A. 1999;96.(1999)10.5504.: 5504.
10.1073/pnas.96.10.5504
Article|CAS|PubMed|Google Scholar

[83] 83.Xu R, Qin N, Xu X, Sun X, Chen X, Zhao J. Implication of SLIT3-ROBO1/ROBO2 in granulosa cell proliferation, differentiation and follicle selection in the prehierarchical follicles of hen ovary. Cell Biol Int. 2018;42.(2018)1002/cbin.11063.: 1643.
10.1002/cbin.11063
Article|CAS|PubMed|Google Scholar

[84] 84.Tian M, Hagg T, Denisova N, Knusel B, Engvall E, Jucker M. Laminin-α2 chain-like antigens in CNS dendritic spines. Brain Res. 1997;764.(1997)org/10.1016/s0006-8993(97)00420-4.: 28.
10.1016/s0006-8993(97)00420-4
Article|CAS|PubMed|Google Scholar

[85] 85.Maier S, Paulsson M, Hartmann U. The widely expressed extracellular matrix protein SMOC-2 promotes keratinocyte attachment and migration. Exp Cell Res. 2008;314.(2008)2477–87. https://doi. org/10. 1016/j.yexcr.: 2477.
10.1016/j.yexcr.2008.05.020
Article|CAS|PubMed|Google Scholar

[86] 86.Pazin DE, Albrecht KH. Developmental expression of Smoc1 and Smoc2 suggests potential roles in fetal gonad and reproductive tract differentiation. Dev Dyn. 2009;238.(2009)1002/dvdy.22124.: 2877.
10.1002/dvdy.22124
Article|CAS|PubMed|Google Scholar

[87] 87.Zhang W, Reeves GR, Tautz D. Testing implications of the omnigenic model for the genetic analysis of loci identified through genome-wide association. Curr Biol. 2021;31(1092–8).(2021)12.023.
10.1016/j.cub.2020.12.023
Article|CAS|PubMed|Google Scholar

[88] 88.Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits.(2017)from polygenic to omnigenic.Cell.: 1177.
Article|CAS|PubMed|Google Scholar

Journal of Animal Science and Biotechnology

Genome-wide analyses reveal intricate genetic mechanisms underlying egg production efficiency in chickens

Abstract

Background

Results

Conclusions

Keywords

Background

Materials and methods

Ethics statement

Sequencing and genotyping

Imputation of missing original egg number

Egg production rate curve and derived egg-laying traits

SNP-based GWAS

Haplotype-based GWAS

CCA-based GWAS

Genome-wide prediction using haplotypes

Analysis on allele-stage interactions

Proportion of variance explained by genetic loci

Enriched score for beneficial and unbeneficial haplotype allele in Chinese chicken populations

Historical effective population size estimation

Calculation of iHS, Tajima's D and π values

SDS for recent polygenic selection

Associating tSDS with GWAS summary statistics

Multi-tissue transcriptome profiling

Gene annotation, functional analysis and prioritization

Results

Generating derived egg-laying traits

Genetic mapping for egg production efficiency

Polygenic architecture for egg-laying traits

Tracking trait-associated haplotype alleles in chicken populations

Polygenic selection on egg production efficiency

Dual role of CNNM2 on egg production variance to optimize egg production efficiency

CNNM2 is functionally important in egg-laying

Discussion

Conclusions

Data Availability

Abbreviations

References

Acknowledgements

Funding

Ethics Declaration

Ethics approval and consent to participate

Consent for publication

Competing interests

Rights and Permissions