The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weigh...The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weights of features in classification. We propose the GASVM algorithm (classification accuracy of support vector machine is regarded as the fitness value of genetic algorithm) to optimize the coefficients of these 16 features (5 features are proposed first time) in the classification, and further develop a new feature vector. Finally, based on the new feature vector, this paper uses support vector machine and 10-fold cross-validation to classify the protein structure of 3 low similarity datasets (25PDB, 1189, FC699). Experimental results show that the overall classification accuracy of the new method is better than other methods.展开更多
It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional proper...It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional properties of proteins at five different stages, including early milky stage(EMS), middle milky stage(MMS), late milky stage(LMS), waxy ripe stage(WS)and ripening stage(RS), during the seed development were investigated. It was found that with the seed developing, the molecular weight of fresh rice protein gradually become larger while the secondary structure changed from the highest content of disordered structure at MMS to the highest content of ordered structure at RS, which affect the surface hydrophobicity and then the functional properties of proteins, including foaming properties, emulsifying properties and oil holding capacity. Fresh rice protein at MMS has the strongest surface hydrophobicity while fresh edible rice protein at RS has the strongest oil holding capability. The results of our study can provide a theoretical basis for the application of fresh rice protein in the food industry and help to develop new fresh edible rice food.展开更多
Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt ...Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt various research paradigms to attack the same structure prediction problem:biochemists and physicists attempt to reveal the principles governing protein folding;mathematicians,especially statisticians,usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure,while computer scientists formulate protein structure prediction as an optimization problem-finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure.These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman,namely,data modeling and algorithmic modeling.Recently,we have also witnessed the great success of deep learning in protein structure prediction.In this review,we present a survey of the efforts for protein structure prediction.We compare the research paradigms adopted by researchers from different fields,with an emphasis on the shift of research paradigms in the era of deep learning.In short,the algorithmic modeling techniques,especially deep neural networks,have considerably improved the accuracy of protein structure prediction;however,theories interpreting the neural networks and knowledge on protein folding are still highly desired.展开更多
Food allergens are mainly naturally-occurring proteins with immunoglobulin E(IgE)-binding epitopes.Understanding the structural and immunogenic characteristics of allergenic proteins is essential in assessing whether ...Food allergens are mainly naturally-occurring proteins with immunoglobulin E(IgE)-binding epitopes.Understanding the structural and immunogenic characteristics of allergenic proteins is essential in assessing whether and how food processing techniques reduce allergenicity.We here discuss the impacts of food processing technologies on the modification of physicochemical,structural,and immunogenic properties of allergenic proteins.Detection techniques for characterizing changes in these properties of food allergens are summarized.Food processing helps to reduce allergenicity by aggregating or denaturing proteins,which masks,modifies,or destroys antigenic epitopes,whereas,it cannot eliminate allergenicity completely,and sometimes even improves allergenicity by exposing new epitopes.Moreover,most food processing techniques have been tested on purified food allergens rather than food products due to potential interference of other food components.We provide guidance for further development of processing operations that can decrease the allergenicity of allergenic food proteins without negatively impacting the nutritional profile.展开更多
Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a gi...Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a given shape. In the mathematical model, molecules are represented as loops of n-simplices (2-simplices are triangles and 3-simplices are tetrahedra). We design a new molecule of a given shape by patching together a set of smaller molecules that cover the shape. The covering set of small molecules is defined using a binary relation between sets of molecules. A new molecule is then obtained as a sum of the smaller molecules, where addition of molecules is defined using transformations acting on a set of (n + 1)-dimensional cones. Due to page limitations, only the two-dimensional case (i.e., loops of triangles) is considered. No prior knowledge of Sheaf Theory, Category Theory, or Protein Science is required. The author hopes that this paper will encourage further collaboration between Mathematics and Protein Science.展开更多
Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing ch...Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing cheese quality.Fourier transform infrared spectroscopy(FTIR),with self-deconvolution,second derivative analysis and band curve-fitting,was used to characterize the secondary structure of proteins in Cheddar cheese during ripening.The spectra of the amide I region showed great similarity,while the relative contents of the secondary structures underwent a series of changes.As ripening progressed,the α-helix content decreased and the β-sheet content increased.This structural shift was attributed to the strengthening of hydrogen bonds that resulted from hydrolysis of caseins.In summary,FTIR could provide the basis for rapid characterization of cheese that is undergoing ripening.展开更多
Background Dietary bamboo leaf flavonoids(BLFs)are rarely used in poultry production,and it is unknown whether they influence meat texture profile,perceived color,or microstructure.Results A total of 720 one-day-old A...Background Dietary bamboo leaf flavonoids(BLFs)are rarely used in poultry production,and it is unknown whether they influence meat texture profile,perceived color,or microstructure.Results A total of 720 one-day-old Arbor Acres broilers were supplemented with a basal diet with 20 mg bacitracin/kg,50 mg BLFs/kg,or 250 mg BLFs/kg or without additions.Data showed that the dietary BLFs significantly(P<0.05)changed growth performance and the texture profile.In particular,BLFs increased birds’average daily gain and average daily feed intake,decreased the feed:gain ratio and mortality rate,improved elasticity of breast meat,enhanced the gumminess of breast and leg meat,and decreased the hardness of breast meat.Moreover,a significant(P<0.05)increase in redness(a*)and chroma(c*)of breast meat and c*and water-holding capacity of leg meat was found in BLF-supplemented broilers compared with control broilers.In addition,BLFs supplementation significantly decreased(P<0.05)theβ-sheet ratio and serum malondialdehyde and increased theβ-turn ratio of protein secondary structure,superoxide dismutase,and glutathione peroxidase of breast meat and total antioxidant capacity and catalase of serum.Based on the analysis of untargeted metabolome,BLFs treatment considerably altered 14 metabolites of the breast meat,including flavonoids,amino acids,and organic acids,as well as phenolic and aromatic compounds.Conclusions Dietary BLFs supplementation could play a beneficial role in improving meat quality and sensory color in the poultry industry by changing protein secondary structures and modulating metabolites.展开更多
The secondary structure of a protein is critical for establishing a link between the protein primary and tertiary structures.For this reason,it is important to design methods for accurate protein secondary structure p...The secondary structure of a protein is critical for establishing a link between the protein primary and tertiary structures.For this reason,it is important to design methods for accurate protein secondary structure prediction.Most of the existing computational techniques for protein structural and functional prediction are based onmachine learning with shallowframeworks.Different deep learning architectures have already been applied to tackle protein secondary structure prediction problem.In this study,deep learning based models,i.e.,convolutional neural network and long short-term memory for protein secondary structure prediction were proposed.The input to proposed models is amino acid sequences which were derived from CulledPDB dataset.Hyperparameter tuning with cross validation was employed to attain best parameters for the proposed models.The proposed models enables effective processing of amino acids and attain approximately 87.05%and 87.47%Q3 accuracy of protein secondary structure prediction for convolutional neural network and long short-term memory models,respectively.展开更多
Protein structure prediction is one of the most essential objectives practiced by theoretical chemistry and bioinformatics as it is of a vital importance in medicine,biotechnology and more.Protein secondary structure ...Protein structure prediction is one of the most essential objectives practiced by theoretical chemistry and bioinformatics as it is of a vital importance in medicine,biotechnology and more.Protein secondary structure prediction(PSSP)has a significant role in the prediction of protein tertiary structure,as it bridges the gap between the protein primary sequences and tertiary structure prediction.Protein secondary structures are classified into two categories:3-state category and 8-state category.Predicting the 3 states and the 8 states of secondary structures from protein sequences are called the Q3 prediction and the Q8 prediction problems,respectively.The 8 classes of secondary structures reveal more precise structural information for a variety of applications than the 3 classes of secondary structures,however,Q8 prediction has been found to be very challenging,that is why all previous work done in PSSP have focused on Q3 prediction.In this paper,we develop an ensemble Machine Learning(ML)approach for Q8 PSSP to explore the performance of ensemble learning algorithms compared to that of individual ML algorithms in Q8 PSSP.The ensemble members considered for constructing the ensemble models are well known classifiers,namely SVM(Support Vector Machines),KNN(K-Nearest Neighbor),DT(Decision Tree),RF(Random Forest),and NB(Naïve Bayes),with two feature extraction techniques,namely LDA(Linear Discriminate Analysis)and PCA(Principal Component Analysis).Experiments have been conducted for evaluating the performance of single models and ensemble models,with PCA and LDA,in Q8 PSSP.The novelty of this paper lies in the introduction of ensemble learning in Q8 PSSP problem.The experimental results confirmed that ensemble ML models are more accurate than individual ML models.They also indicated that features extracted by LDA are more effective than those extracted by PCA.展开更多
The algorithm based on combination learning usually is superior to a singleclassification algorithm on the task of protein secondary structure prediction. However,the assignment of the weight of the base classifier us...The algorithm based on combination learning usually is superior to a singleclassification algorithm on the task of protein secondary structure prediction. However,the assignment of the weight of the base classifier usually lacks decision-makingevidence. In this paper, we propose a protein secondary structure prediction method withdynamic self-adaptation combination strategy based on entropy, where the weights areassigned according to the entropy of posterior probabilities outputted by base classifiers.The higher entropy value means a lower weight for the base classifier. The final structureprediction is decided by the weighted combination of posterior probabilities. Extensiveexperiments on CB513 dataset demonstrates that the proposed method outperforms theexisting methods, which can effectively improve the prediction performance.展开更多
The aim of the present study was to assess whether Fourier transform infrared spectrometry (FTIR) micro-spectroscopy could produce distinct spectral information on protein of old myocardial infarction (OMI) and to set...The aim of the present study was to assess whether Fourier transform infrared spectrometry (FTIR) micro-spectroscopy could produce distinct spectral information on protein of old myocardial infarction (OMI) and to set them as molecular markers to diagnose atypical OMI. Paraffin-embedded heart samples were derived from victims dying of OMI. In combination with histological stain, FTIR and infrared micro-spectroscopy, the characteristics of OMI were analyzed morphologically and molecularly. The most relevant bands identified were the amide A, B, Ⅰ and Ⅱ, showing crucial spectral differences between apparent normal region and OMI region, including the peak position blue shift and the increased intensity of OMI, moreover relative increase in α-helix and decrease in β-sheet of protein secondary structures in OMI. Comparing to single spectral band, the I1650/I1550 ratio was increased and rationally used as a molecular marker for diagnosing OMI. These novel preliminary findings supported further exploration of FTIR molecular profiling in clinical or forensic study, and were in accordance with histopathology.展开更多
Genes homologous to members of the MRP gene family in Caenorhabditis elegans are important in drug resistance.To further explore the molecular mechanism of drug resistance in pine wood nematode(Bursaphelenchus xylophi...Genes homologous to members of the MRP gene family in Caenorhabditis elegans are important in drug resistance.To further explore the molecular mechanism of drug resistance in pine wood nematode(Bursaphelenchus xylophilus),we used bioinformatics approaches to analyze genomic data for B.xylophilus and identified Bx-MRP genes.We predicted the structure and function of the genes and encoded proteins.Using bioinformatics programs to predict and analyze various properties of the predicted proteins,including hydrophobicity,transmembrane regions,phosphorylation sites,and topologically isomeric structures,of these Bx-MRP genes,we determined that they function in transmembrane transport.From the results of RT-qPCR,the Bx-MRP family members confer significant differential resistance to different drug treatments.After treatment with different concentrations of emamectin benzoate,avermectin and matrine,the expression of each gene increased with increasing drug concentrations,indicating that the family members play a positive role in the regulation of multidrug resistance.展开更多
The liver is the site of synthesis of the majority of circulating proteins.Besides initial polypeptide synthesis,sophisticated machinery is involved in the further processing of proteins by removing parts of them and/...The liver is the site of synthesis of the majority of circulating proteins.Besides initial polypeptide synthesis,sophisticated machinery is involved in the further processing of proteins by removing parts of them and/or adding functional groups and small molecules tailoring the final molecule to suit its physiological purpose.Posttranslational modifications(PTMs)design a network of molecules with the common protein ancestor but with slightly or considerably varying activity/localization/purpose.PTMs can change under pathological conditions,giving rise to aberrant or overmodified proteins.Undesired changes in the structure of proteins most often accompany undesired changes in their function,such as reduced activity or the appearance of new effects.Proper protein processing is essential for the reactions in living beings and crucial for the overall quality control.Modifications that occur on proteins synthesized in the liver whose PTMs are cirrhosis-related are oxidation,nitration,glycosylation,acetylation,and ubiquitination.Some of them predominantly affect proteins that remain in liver cells,whereas others predominantly occur on proteins that leave the liver or originate from other tissues and perform their function in the circulation.Altered PTMs of certain proteins are potential candidates as biomarkers of liver-related diseases,including cirrhosis.This review will focus on PTMs on proteins whose structural changes in cirrhosis exert or are suspected to exert the most serious functional consequences.展开更多
This paper proposes a novel application of cohomology to protein structure analysis. Since proteins interact each other by forming transient protein complexes, their shape (e.g., shape complementarity) plays an import...This paper proposes a novel application of cohomology to protein structure analysis. Since proteins interact each other by forming transient protein complexes, their shape (e.g., shape complementarity) plays an important role in their functions. In our mathematical toy models, proteins are represented as a loop of triangles (2D model) or tetrahedra (3D model), where their interactions are defined as fusion of loops. The purpose of this paper is to describe the conditions for loop fusion using the language of cohomology. In particular, this paper uses cohomology to describe the conditions for “allosteric regulation”, which has been attracted attention in safer drug discovery. I hope that this paper will provide a new perspective on the mechanism of allosteric regulation. Advantages of the model include its topological nature. That is, we can deform the shape of loops by deforming the shape of triangles (or tetrahedra) as long as their folded structures are preserved. Another advantage is the simplicity of the “allosteric regulation” mechanism of the model. Furthermore, the effect of the “post-translational modification” can be understood as a resolution of singularities of a flow of triangles (or tetrahedra). No prior knowledge of either protein science, exterior calculus, or cohomology theory is required. The author hopes that this paper will facilitate the interaction between mathematics and protein science.展开更多
In the present investigation the structural proteins associated with MAC-1 bacteriophage have been characterized using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE);tandem mass spectrometry of p...In the present investigation the structural proteins associated with MAC-1 bacteriophage have been characterized using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE);tandem mass spectrometry of protein bands from SDS-PAGE gel;from the open reading frames (ORFs) deduced from MAC-1 genome sequence and amino acid sequence homology searches from the Uniprot database (up000002418). Results have led to the identification of at least three structural proteins associated with MAC-1 phage genome. They are: capsid protein (~55,000-daltons);spike protein (~22,000-daltons) and a low molecular weight DNA binding protein (~4000-dal- tons). In addition, two other minor proteins were tentatively identified as replicative and scaffold proteins based on two to three unique peptides from mass spectrometry data. However, other proteins coded (ORFs) by phage genome remain to be identified.展开更多
The Newcastle disease virus(NDV)negative-strand RNA genome contains six genes.These genes encode nucleoprotein(NP),phosphoprotein(P),matrix protein(M),fusion protein(F),hemagglutinin-neuraminidase(HN),and RNA-dependen...The Newcastle disease virus(NDV)negative-strand RNA genome contains six genes.These genes encode nucleoprotein(NP),phosphoprotein(P),matrix protein(M),fusion protein(F),hemagglutinin-neuraminidase(HN),and RNA-dependent RNA polymerase(L)proteins.The six proteins affect the virulence of NDV in different ways,but available information on the six proteins is disparate and scattered across many databases and sources.A comprehensive overview of the proteins determining NDV virulence is lacking.This review summarizes the virulence of NDV as a complex trait determined by these six different proteins.展开更多
Mitogen-activated protein kinases(MAPKs,also known as MPKs)regulate diverse cellular and physiological functions,and dual-specificity MAPK phosphatases(MKPs)modulate MAPK signalling through MAPK dephosphorylation and ...Mitogen-activated protein kinases(MAPKs,also known as MPKs)regulate diverse cellular and physiological functions,and dual-specificity MAPK phosphatases(MKPs)modulate MAPK signalling through MAPK dephosphorylation and inactivation.Due to lacking of overall understanding for the regulatory networks between Chrysanthemum morifolium MKPs(CmMKPs)and C.morifolium MAPKs(CmMPKs),we systematically studied the interactions between four groups of CmMPKs and eight identified CmMKPs in chrysanthemum and found that the interaction between the specific CmMKP and the specific CmMPK differed from those in other plants.Furthermore,the expression of CmMKP1 and CmMKP1-LIKE1showed opposite trends during the development of chrysanthemum flower buds under salt treatment and Alternaria alternata inoculation,but these genes could interact with the same CmMPKs,providing insight into the subfunctionalization of paralogues.Amino acid variations(M87V,T277P and V6L)in dual-specificity protein phosphatases(DsPTP1)-LIKE1/2/3 changed the interactions of these proteins with the four CmMPK groups in chrysanthemum,providing evidence for the de/neofunctionalization of paralogues in polyploids,suggesting that we can identify the key functional sites of proteins by studying polyploid paralogues.展开更多
Available online Immunoglobulins G(IgGs)are Y-shaped globular proteins,however,their high flexibility and heterogeneity pose great challenges to their structure and conformation determinations.Geometric structure of I...Available online Immunoglobulins G(IgGs)are Y-shaped globular proteins,however,their high flexibility and heterogeneity pose great challenges to their structure and conformation determinations.Geometric structure of IgG closely correlates to its biofunctions,such as the antibody escape of human immunodeficiency virus(HIV)could attribute to the distance mismatch between the ends of two Fab arms(antigen-binding sites)and envelope glycoprotein spikes on virion surface.Herein,we report the first use of mobility capillary electrophoresis(MCE)and native mass spectrometry(nMS)to resolve the internal geometric structure and conformation of an IgG(trastuzumab)in solution phase.After proteolysis,the ellipsoid dimensions of IgG and its subunits were measured by MCE-nMS experiments.IgG was then reconstructed,in which the sizes and relative positions of these three subunits in three-dimensional space were characterized.It was found that the two Fab arms have an angle of~102.1°and a distance of~11.0 nm between the two antigen-binding sites under native condition,and the Fc arm was tilted~16.0°towards one of the Fab arms.Fc was not on the plane of Fab-Fab,but has an angle of no larger than 103.1°.Under acidic environment(pH 3.0),each subunit of the IgG would unfold into larger dimensions,and the angles between these subunits also change.With great potential for tumor imaging and therapy,the structure of F(ab')_(2)fragments was also measured and validated by molecular dynamic simulation.It was found that the electrostatic force among these three subunits and steric hindrance stemming from Fc help maintaining the angle between two Fab arms.展开更多
Objective:To understand the cause for the differences between potentially mild Southeast Asian and the more pathogenic ZIKV in South America.Methods:A comparative genomic analysis was performed to determine putative c...Objective:To understand the cause for the differences between potentially mild Southeast Asian and the more pathogenic ZIKV in South America.Methods:A comparative genomic analysis was performed to determine putative causations stemming from ZIKV.Results:Phylogcnctic analyses integrating geographical and time factors revealed that Southeast Asian ZIKV might not be the direct source of South American outbreaks as previously speculated.Amino acid residues unique to South American ZIKV isolates at the envelope,pr and NS1 proteins are listed and shown in the structural context.These unique residues on external viral proteins are not found in Southeast Asian ZIKV and could be responsible for the ongoing outbreak either via an intrinsic property of the virus or interactions with human immunity.Only a selected few primer/probe sets currently in clinical use were identified of being capable of detecting ZIKV strains worldwide.The envelope proteins of dengue virus(DENV) and ZIKV also showed a remarkable degree of similarity especially at the surface residues.Conclusions:findings that may help explain the cross-reactivity of DENV antibodies to ZIKV.Thus,major caveats must be exercised in using existing diagnostic tools for ZIKV.展开更多
A full-length lily-type lectin(Sm LTL) was identified from turbot(Scophthalmus maximus) in this study. By searching database for protein identification and function prediction, Sm LTL were confirmed. The full-length c...A full-length lily-type lectin(Sm LTL) was identified from turbot(Scophthalmus maximus) in this study. By searching database for protein identification and function prediction, Sm LTL were confirmed. The full-length c DNA of Sm LTL is composed of 569 bp and contains a 339 bp ORF that encodes 112 amino acid residues. The Sm LTL peptide is characterized by a specific β-prism architecture and contains three mannose binding sites in a three-fold internal repeat between amino acids 30–99; two of the repeats share the classical mannose binding domain(Qx Dx Nx Vx Y) while the third binding site was similar to other fish-specific binding motifs(Tx Tx Gx Rx V). The primary, secondary, and tertiary structures of Sm LTL were predicted and analyzed, indicating that the S m LTL protein was hydrophilic, contained 5.36% α-helices, 39.29% extended strands, 16.07% β-folds, and 39.29% random coils, and three β-folds. Quantitative realtime polymerase chain reaction(qPCR) analysis revealed that the Sm LTL mRNA was abundantly expressed in skin, gill, and intestine. Low levels of Sm LTL expression were observed in other tissues. The expression of Sm LTL in gill, skin and intestine increased at m RNA level after stimulation of V ibrio anguillarum, our results suggest that Sm LTL serve as the first line of defence against microbial infections and play a pivotal role in the innate mucosal immune system. The current study indicates that Sm LTL is a member of the lilytype lectin family and the information reported here will provide an important foundation for future research on the role of this protein.展开更多
文摘The research methods of protein structure prediction mainly focus on finding effective features of protein sequences and developing suitable machine learning algorithms. But few people consider the importance of weights of features in classification. We propose the GASVM algorithm (classification accuracy of support vector machine is regarded as the fitness value of genetic algorithm) to optimize the coefficients of these 16 features (5 features are proposed first time) in the classification, and further develop a new feature vector. Finally, based on the new feature vector, this paper uses support vector machine and 10-fold cross-validation to classify the protein structure of 3 low similarity datasets (25PDB, 1189, FC699). Experimental results show that the overall classification accuracy of the new method is better than other methods.
基金the financial support from the Postdoctoral Research Project of Heilongjiang Provincial Department of Human Resources and Social Security (LBH-Q21156)Heilongjiang BaYi Agricultural University Support Program for San Zong San Heng (ZDZX202104)+3 种基金Science Foundation Project of Heilongjiang Province (QC2015028)National Natural Science Foundation of China (32072258)Major Science and technology Program of Heilongjiang (2019ZX08B02,2020ZX08B02)Central financial support for the development of local colleges and universities,Graduate research and innovation project of Harbin University of Commerce (YJSCX2020636HSD)。
文摘It has been reported that fresh edible rice has more bioactive compounds and its protein is easier to digest and has lower hypoallergenic than mature rice. In this paper, the changes in structure and functional properties of proteins at five different stages, including early milky stage(EMS), middle milky stage(MMS), late milky stage(LMS), waxy ripe stage(WS)and ripening stage(RS), during the seed development were investigated. It was found that with the seed developing, the molecular weight of fresh rice protein gradually become larger while the secondary structure changed from the highest content of disordered structure at MMS to the highest content of ordered structure at RS, which affect the surface hydrophobicity and then the functional properties of proteins, including foaming properties, emulsifying properties and oil holding capacity. Fresh rice protein at MMS has the strongest surface hydrophobicity while fresh edible rice protein at RS has the strongest oil holding capability. The results of our study can provide a theoretical basis for the application of fresh rice protein in the food industry and help to develop new fresh edible rice food.
基金the National Key R&D Program of China(Grant No.2020YFA0907000)lthe National Natural Science Foundation of China(Grant Nos.32271297,62072435,31770775,and 31671369)for providing financial support for this study and publication charges.
文摘Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields,including biochemistry,medicine,physics,mathematics,and computer science.These researchers adopt various research paradigms to attack the same structure prediction problem:biochemists and physicists attempt to reveal the principles governing protein folding;mathematicians,especially statisticians,usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure,while computer scientists formulate protein structure prediction as an optimization problem-finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure.These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman,namely,data modeling and algorithmic modeling.Recently,we have also witnessed the great success of deep learning in protein structure prediction.In this review,we present a survey of the efforts for protein structure prediction.We compare the research paradigms adopted by researchers from different fields,with an emphasis on the shift of research paradigms in the era of deep learning.In short,the algorithmic modeling techniques,especially deep neural networks,have considerably improved the accuracy of protein structure prediction;however,theories interpreting the neural networks and knowledge on protein folding are still highly desired.
基金supported by the National Natural Science Foundation of China (32102605)the Agricultural Science and Technology Innovation Program under Grant (CAAS-ASTIP-2020IAR)the Earmarked Fund for CARS (CARS-44)。
文摘Food allergens are mainly naturally-occurring proteins with immunoglobulin E(IgE)-binding epitopes.Understanding the structural and immunogenic characteristics of allergenic proteins is essential in assessing whether and how food processing techniques reduce allergenicity.We here discuss the impacts of food processing technologies on the modification of physicochemical,structural,and immunogenic properties of allergenic proteins.Detection techniques for characterizing changes in these properties of food allergens are summarized.Food processing helps to reduce allergenicity by aggregating or denaturing proteins,which masks,modifies,or destroys antigenic epitopes,whereas,it cannot eliminate allergenicity completely,and sometimes even improves allergenicity by exposing new epitopes.Moreover,most food processing techniques have been tested on purified food allergens rather than food products due to potential interference of other food components.We provide guidance for further development of processing operations that can decrease the allergenicity of allergenic food proteins without negatively impacting the nutritional profile.
文摘Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a given shape. In the mathematical model, molecules are represented as loops of n-simplices (2-simplices are triangles and 3-simplices are tetrahedra). We design a new molecule of a given shape by patching together a set of smaller molecules that cover the shape. The covering set of small molecules is defined using a binary relation between sets of molecules. A new molecule is then obtained as a sum of the smaller molecules, where addition of molecules is defined using transformations acting on a set of (n + 1)-dimensional cones. Due to page limitations, only the two-dimensional case (i.e., loops of triangles) is considered. No prior knowledge of Sheaf Theory, Category Theory, or Protein Science is required. The author hopes that this paper will encourage further collaboration between Mathematics and Protein Science.
基金financially supported by Beijing Municipal Commission of Education Co-Constructed Programand Chinese Universities Scientific Fund(2009-4-25)
文摘Proteolysis is one of the most important biochemical reactions during cheese ripening.Studies on the secondary structure of proteins during ripening would be helpful for characterizing protein changes for assessing cheese quality.Fourier transform infrared spectroscopy(FTIR),with self-deconvolution,second derivative analysis and band curve-fitting,was used to characterize the secondary structure of proteins in Cheddar cheese during ripening.The spectra of the amide I region showed great similarity,while the relative contents of the secondary structures underwent a series of changes.As ripening progressed,the α-helix content decreased and the β-sheet content increased.This structural shift was attributed to the strengthening of hydrogen bonds that resulted from hydrolysis of caseins.In summary,FTIR could provide the basis for rapid characterization of cheese that is undergoing ripening.
基金supported by the National Natural Science Foundation of China(No.32002195)Zhejiang Provincial Leading Innovation and Entrepreneurship Team Project(No.2020R01015)+1 种基金“Leading Geese”Research and Development Plan of Zhejiang Province(No.2022C02059)Key R&D Projects of Zhejiang Province(No.2021C02013)。
文摘Background Dietary bamboo leaf flavonoids(BLFs)are rarely used in poultry production,and it is unknown whether they influence meat texture profile,perceived color,or microstructure.Results A total of 720 one-day-old Arbor Acres broilers were supplemented with a basal diet with 20 mg bacitracin/kg,50 mg BLFs/kg,or 250 mg BLFs/kg or without additions.Data showed that the dietary BLFs significantly(P<0.05)changed growth performance and the texture profile.In particular,BLFs increased birds’average daily gain and average daily feed intake,decreased the feed:gain ratio and mortality rate,improved elasticity of breast meat,enhanced the gumminess of breast and leg meat,and decreased the hardness of breast meat.Moreover,a significant(P<0.05)increase in redness(a*)and chroma(c*)of breast meat and c*and water-holding capacity of leg meat was found in BLF-supplemented broilers compared with control broilers.In addition,BLFs supplementation significantly decreased(P<0.05)theβ-sheet ratio and serum malondialdehyde and increased theβ-turn ratio of protein secondary structure,superoxide dismutase,and glutathione peroxidase of breast meat and total antioxidant capacity and catalase of serum.Based on the analysis of untargeted metabolome,BLFs treatment considerably altered 14 metabolites of the breast meat,including flavonoids,amino acids,and organic acids,as well as phenolic and aromatic compounds.Conclusions Dietary BLFs supplementation could play a beneficial role in improving meat quality and sensory color in the poultry industry by changing protein secondary structures and modulating metabolites.
文摘The secondary structure of a protein is critical for establishing a link between the protein primary and tertiary structures.For this reason,it is important to design methods for accurate protein secondary structure prediction.Most of the existing computational techniques for protein structural and functional prediction are based onmachine learning with shallowframeworks.Different deep learning architectures have already been applied to tackle protein secondary structure prediction problem.In this study,deep learning based models,i.e.,convolutional neural network and long short-term memory for protein secondary structure prediction were proposed.The input to proposed models is amino acid sequences which were derived from CulledPDB dataset.Hyperparameter tuning with cross validation was employed to attain best parameters for the proposed models.The proposed models enables effective processing of amino acids and attain approximately 87.05%and 87.47%Q3 accuracy of protein secondary structure prediction for convolutional neural network and long short-term memory models,respectively.
文摘Protein structure prediction is one of the most essential objectives practiced by theoretical chemistry and bioinformatics as it is of a vital importance in medicine,biotechnology and more.Protein secondary structure prediction(PSSP)has a significant role in the prediction of protein tertiary structure,as it bridges the gap between the protein primary sequences and tertiary structure prediction.Protein secondary structures are classified into two categories:3-state category and 8-state category.Predicting the 3 states and the 8 states of secondary structures from protein sequences are called the Q3 prediction and the Q8 prediction problems,respectively.The 8 classes of secondary structures reveal more precise structural information for a variety of applications than the 3 classes of secondary structures,however,Q8 prediction has been found to be very challenging,that is why all previous work done in PSSP have focused on Q3 prediction.In this paper,we develop an ensemble Machine Learning(ML)approach for Q8 PSSP to explore the performance of ensemble learning algorithms compared to that of individual ML algorithms in Q8 PSSP.The ensemble members considered for constructing the ensemble models are well known classifiers,namely SVM(Support Vector Machines),KNN(K-Nearest Neighbor),DT(Decision Tree),RF(Random Forest),and NB(Naïve Bayes),with two feature extraction techniques,namely LDA(Linear Discriminate Analysis)and PCA(Principal Component Analysis).Experiments have been conducted for evaluating the performance of single models and ensemble models,with PCA and LDA,in Q8 PSSP.The novelty of this paper lies in the introduction of ensemble learning in Q8 PSSP problem.The experimental results confirmed that ensemble ML models are more accurate than individual ML models.They also indicated that features extracted by LDA are more effective than those extracted by PCA.
文摘The algorithm based on combination learning usually is superior to a singleclassification algorithm on the task of protein secondary structure prediction. However,the assignment of the weight of the base classifier usually lacks decision-makingevidence. In this paper, we propose a protein secondary structure prediction method withdynamic self-adaptation combination strategy based on entropy, where the weights areassigned according to the entropy of posterior probabilities outputted by base classifiers.The higher entropy value means a lower weight for the base classifier. The final structureprediction is decided by the weighted combination of posterior probabilities. Extensiveexperiments on CB513 dataset demonstrates that the proposed method outperforms theexisting methods, which can effectively improve the prediction performance.
文摘The aim of the present study was to assess whether Fourier transform infrared spectrometry (FTIR) micro-spectroscopy could produce distinct spectral information on protein of old myocardial infarction (OMI) and to set them as molecular markers to diagnose atypical OMI. Paraffin-embedded heart samples were derived from victims dying of OMI. In combination with histological stain, FTIR and infrared micro-spectroscopy, the characteristics of OMI were analyzed morphologically and molecularly. The most relevant bands identified were the amide A, B, Ⅰ and Ⅱ, showing crucial spectral differences between apparent normal region and OMI region, including the peak position blue shift and the increased intensity of OMI, moreover relative increase in α-helix and decrease in β-sheet of protein secondary structures in OMI. Comparing to single spectral band, the I1650/I1550 ratio was increased and rationally used as a molecular marker for diagnosing OMI. These novel preliminary findings supported further exploration of FTIR molecular profiling in clinical or forensic study, and were in accordance with histopathology.
基金supported by the National Key Research and Development Program of China(2017YFD0600101).
文摘Genes homologous to members of the MRP gene family in Caenorhabditis elegans are important in drug resistance.To further explore the molecular mechanism of drug resistance in pine wood nematode(Bursaphelenchus xylophilus),we used bioinformatics approaches to analyze genomic data for B.xylophilus and identified Bx-MRP genes.We predicted the structure and function of the genes and encoded proteins.Using bioinformatics programs to predict and analyze various properties of the predicted proteins,including hydrophobicity,transmembrane regions,phosphorylation sites,and topologically isomeric structures,of these Bx-MRP genes,we determined that they function in transmembrane transport.From the results of RT-qPCR,the Bx-MRP family members confer significant differential resistance to different drug treatments.After treatment with different concentrations of emamectin benzoate,avermectin and matrine,the expression of each gene increased with increasing drug concentrations,indicating that the family members play a positive role in the regulation of multidrug resistance.
基金Supported by the Ministry of Education,Science and Technological Development of the Republic of Serbia,No.451-03-9/2021-14/200019.
文摘The liver is the site of synthesis of the majority of circulating proteins.Besides initial polypeptide synthesis,sophisticated machinery is involved in the further processing of proteins by removing parts of them and/or adding functional groups and small molecules tailoring the final molecule to suit its physiological purpose.Posttranslational modifications(PTMs)design a network of molecules with the common protein ancestor but with slightly or considerably varying activity/localization/purpose.PTMs can change under pathological conditions,giving rise to aberrant or overmodified proteins.Undesired changes in the structure of proteins most often accompany undesired changes in their function,such as reduced activity or the appearance of new effects.Proper protein processing is essential for the reactions in living beings and crucial for the overall quality control.Modifications that occur on proteins synthesized in the liver whose PTMs are cirrhosis-related are oxidation,nitration,glycosylation,acetylation,and ubiquitination.Some of them predominantly affect proteins that remain in liver cells,whereas others predominantly occur on proteins that leave the liver or originate from other tissues and perform their function in the circulation.Altered PTMs of certain proteins are potential candidates as biomarkers of liver-related diseases,including cirrhosis.This review will focus on PTMs on proteins whose structural changes in cirrhosis exert or are suspected to exert the most serious functional consequences.
文摘This paper proposes a novel application of cohomology to protein structure analysis. Since proteins interact each other by forming transient protein complexes, their shape (e.g., shape complementarity) plays an important role in their functions. In our mathematical toy models, proteins are represented as a loop of triangles (2D model) or tetrahedra (3D model), where their interactions are defined as fusion of loops. The purpose of this paper is to describe the conditions for loop fusion using the language of cohomology. In particular, this paper uses cohomology to describe the conditions for “allosteric regulation”, which has been attracted attention in safer drug discovery. I hope that this paper will provide a new perspective on the mechanism of allosteric regulation. Advantages of the model include its topological nature. That is, we can deform the shape of loops by deforming the shape of triangles (or tetrahedra) as long as their folded structures are preserved. Another advantage is the simplicity of the “allosteric regulation” mechanism of the model. Furthermore, the effect of the “post-translational modification” can be understood as a resolution of singularities of a flow of triangles (or tetrahedra). No prior knowledge of either protein science, exterior calculus, or cohomology theory is required. The author hopes that this paper will facilitate the interaction between mathematics and protein science.
文摘In the present investigation the structural proteins associated with MAC-1 bacteriophage have been characterized using sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE);tandem mass spectrometry of protein bands from SDS-PAGE gel;from the open reading frames (ORFs) deduced from MAC-1 genome sequence and amino acid sequence homology searches from the Uniprot database (up000002418). Results have led to the identification of at least three structural proteins associated with MAC-1 phage genome. They are: capsid protein (~55,000-daltons);spike protein (~22,000-daltons) and a low molecular weight DNA binding protein (~4000-dal- tons). In addition, two other minor proteins were tentatively identified as replicative and scaffold proteins based on two to three unique peptides from mass spectrometry data. However, other proteins coded (ORFs) by phage genome remain to be identified.
文摘The Newcastle disease virus(NDV)negative-strand RNA genome contains six genes.These genes encode nucleoprotein(NP),phosphoprotein(P),matrix protein(M),fusion protein(F),hemagglutinin-neuraminidase(HN),and RNA-dependent RNA polymerase(L)proteins.The six proteins affect the virulence of NDV in different ways,but available information on the six proteins is disparate and scattered across many databases and sources.A comprehensive overview of the proteins determining NDV virulence is lacking.This review summarizes the virulence of NDV as a complex trait determined by these six different proteins.
基金funded by the National Natural Science Foundation of China(Grant Nos.31870694,31870279,32002083)the Natural Science Fund of Jiangsu Province(Grant No.BK20190076)+2 种基金the National Key R&D Program of China(Grant No.2018YFD1000403)the Foundation of Central Laboratory of Xinyang Agriculture and Forestry University(Grant No.FCL202002)a project funded by the Priority Academic Program Development of Jiangsu Higher Education Institution。
文摘Mitogen-activated protein kinases(MAPKs,also known as MPKs)regulate diverse cellular and physiological functions,and dual-specificity MAPK phosphatases(MKPs)modulate MAPK signalling through MAPK dephosphorylation and inactivation.Due to lacking of overall understanding for the regulatory networks between Chrysanthemum morifolium MKPs(CmMKPs)and C.morifolium MAPKs(CmMPKs),we systematically studied the interactions between four groups of CmMPKs and eight identified CmMKPs in chrysanthemum and found that the interaction between the specific CmMKP and the specific CmMPK differed from those in other plants.Furthermore,the expression of CmMKP1 and CmMKP1-LIKE1showed opposite trends during the development of chrysanthemum flower buds under salt treatment and Alternaria alternata inoculation,but these genes could interact with the same CmMPKs,providing insight into the subfunctionalization of paralogues.Amino acid variations(M87V,T277P and V6L)in dual-specificity protein phosphatases(DsPTP1)-LIKE1/2/3 changed the interactions of these proteins with the four CmMPK groups in chrysanthemum,providing evidence for the de/neofunctionalization of paralogues in polyploids,suggesting that we can identify the key functional sites of proteins by studying polyploid paralogues.
基金supported by Ministry of Science and Technology of the People’s Republic of China instrumentation program(No.2020YFF01014502)NNSFC(No.21827810)Beijing Institute of Technology(No.2021CX006)。
文摘Available online Immunoglobulins G(IgGs)are Y-shaped globular proteins,however,their high flexibility and heterogeneity pose great challenges to their structure and conformation determinations.Geometric structure of IgG closely correlates to its biofunctions,such as the antibody escape of human immunodeficiency virus(HIV)could attribute to the distance mismatch between the ends of two Fab arms(antigen-binding sites)and envelope glycoprotein spikes on virion surface.Herein,we report the first use of mobility capillary electrophoresis(MCE)and native mass spectrometry(nMS)to resolve the internal geometric structure and conformation of an IgG(trastuzumab)in solution phase.After proteolysis,the ellipsoid dimensions of IgG and its subunits were measured by MCE-nMS experiments.IgG was then reconstructed,in which the sizes and relative positions of these three subunits in three-dimensional space were characterized.It was found that the two Fab arms have an angle of~102.1°and a distance of~11.0 nm between the two antigen-binding sites under native condition,and the Fc arm was tilted~16.0°towards one of the Fab arms.Fc was not on the plane of Fab-Fab,but has an angle of no larger than 103.1°.Under acidic environment(pH 3.0),each subunit of the IgG would unfold into larger dimensions,and the angles between these subunits also change.With great potential for tumor imaging and therapy,the structure of F(ab')_(2)fragments was also measured and validated by molecular dynamic simulation.It was found that the electrostatic force among these three subunits and steric hindrance stemming from Fc help maintaining the angle between two Fab arms.
基金supported by Thailand Research Fund-Mahidol University(RSA5880062)by the Office of the Higher Education Commission and Mahidol University under the National Research Universities Initiativesupported by the Royal Golden Jubilee Ph.D.Program (Grant No.PHD/0204/2552)
文摘Objective:To understand the cause for the differences between potentially mild Southeast Asian and the more pathogenic ZIKV in South America.Methods:A comparative genomic analysis was performed to determine putative causations stemming from ZIKV.Results:Phylogcnctic analyses integrating geographical and time factors revealed that Southeast Asian ZIKV might not be the direct source of South American outbreaks as previously speculated.Amino acid residues unique to South American ZIKV isolates at the envelope,pr and NS1 proteins are listed and shown in the structural context.These unique residues on external viral proteins are not found in Southeast Asian ZIKV and could be responsible for the ongoing outbreak either via an intrinsic property of the virus or interactions with human immunity.Only a selected few primer/probe sets currently in clinical use were identified of being capable of detecting ZIKV strains worldwide.The envelope proteins of dengue virus(DENV) and ZIKV also showed a remarkable degree of similarity especially at the surface residues.Conclusions:findings that may help explain the cross-reactivity of DENV antibodies to ZIKV.Thus,major caveats must be exercised in using existing diagnostic tools for ZIKV.
基金Supported by the Earmarked Fund for Modern Agro-Industry Technology Research System(No.CARS-50-G01)the General Financial Grant from the China Postdoctoral Science Foundation(No.2015-2016)+4 种基金the Special Financial Grant from the China Postdoctoral Science Foundation(No.2016T90661)the Shandong Provincial Natural Science Foundation(No.ZR2014CP001)the National High Technology Research and Development Program of China(863 Program)(No.2012AA10A408-8)the Primary Research&Developement Plan of Shandong Province(No.2016GSF115019)the Shandong Agriculture Seed Project(No.2016LZGC031)
文摘A full-length lily-type lectin(Sm LTL) was identified from turbot(Scophthalmus maximus) in this study. By searching database for protein identification and function prediction, Sm LTL were confirmed. The full-length c DNA of Sm LTL is composed of 569 bp and contains a 339 bp ORF that encodes 112 amino acid residues. The Sm LTL peptide is characterized by a specific β-prism architecture and contains three mannose binding sites in a three-fold internal repeat between amino acids 30–99; two of the repeats share the classical mannose binding domain(Qx Dx Nx Vx Y) while the third binding site was similar to other fish-specific binding motifs(Tx Tx Gx Rx V). The primary, secondary, and tertiary structures of Sm LTL were predicted and analyzed, indicating that the S m LTL protein was hydrophilic, contained 5.36% α-helices, 39.29% extended strands, 16.07% β-folds, and 39.29% random coils, and three β-folds. Quantitative realtime polymerase chain reaction(qPCR) analysis revealed that the Sm LTL mRNA was abundantly expressed in skin, gill, and intestine. Low levels of Sm LTL expression were observed in other tissues. The expression of Sm LTL in gill, skin and intestine increased at m RNA level after stimulation of V ibrio anguillarum, our results suggest that Sm LTL serve as the first line of defence against microbial infections and play a pivotal role in the innate mucosal immune system. The current study indicates that Sm LTL is a member of the lilytype lectin family and the information reported here will provide an important foundation for future research on the role of this protein.