Data Availability

isrdo-SRJSET

Scientific Research Journal of Science, Engineering and Technology

SRJSET

2584-0584

ISRDO

Gujarat,India

M-10000

Computer Science and Engineering

Use Apriori, Genetic Algorithm and Fuzzy Logic to Foretell the Most Common Amino Acid Sequence

Krupali Patel

10S P UniversityIndia

Pravinbhai Patel

21S P UniversityIndia

02112022

V1-I1-2023

1110202201110111

2022

Krupali Patel

This is an open access article distributed under the terms of the, which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (ISRDO) and either DOI or URL of the article must be cited.Creative Commons Attribution License

Data mining is the practise of discovering connections between seemingly unrelated pieces of biological information. Rapid progress in genomics and proteomics in recent years has resulted in an abundance of biological data. Thus, categorising biological sequences and structures according to essential properties and functions is a pressing issue in the field of biological data processing. Many methods have been used to generate recurrent patterns from published works for use in a wide range of contexts. The frequency with which this algorithm was produced has diminished. Because of this, it's completely pointless. In this case, I want to use two different methods to compare the common pattern and optimise the data. Hence, we find it to be of great value. The contaminated protein sequence is the root cause of several human illnesses, and our method is designed to extract the amino acids that are both hidden and most dominant in the sequence. We deal with this issue by employing a combination of the apriori algorithm, the genetic algorithm, and strong association rules for pattern prediction. Apply fuzzy logic to the optimisation of data and the identification of intriguing common patterns in the protein sequence database. This Recurring Pattern is quite helpful in the Pharmaceutical Industry.

Genetic AlgorithmsProtein structure analysis Association methods Fuzzy Systems for mining biological data

No funding was provided to the author(s) of this article during its research, writing, or publishing.

Data Availability

Data sharing is not relevant to this topic since no datasets were created or analyzed over the course of this investigation.

Conflicts of Interest

Each author confirms that they have no competing interests.

Authors’ Contributions

Krupali Patel participated in the conception and execution of the study, the analysis and interpretation of the data, and the drafting of the paper. Pravinbhai managed the work on this project.

Funding Statement

No funding was provided to the author(s) of this article during its research, writing, or publishing.

software-information

I have used Rapid miner.

Acknowledgments

I owe a great debt of appreciation to Pravinbhai, my primary supervisor, who gave me invaluable direction throughout this endeavour. I'd also want to say thanks to the friends and family members that helped me during this process and provided invaluable feedback and insights.

Jiawei Han, Hong Cheng, Dong Xin and Xifeng Yan, “Frequent pattern mining: current ststus and future directions”, Data Mining Knowledge Discovery(2007) 15:55 -86.

Lakshmi Priya. G., Shanmugasundaram Hariharan “A Study On Predicting Patterns Over The Protein Sequence Datasets Using Association Rule MINING”, Journal Of Engineering Science And Technology Vol. 7, No. 5. (2012) 563 – 573

Davnah Urbach And Jason H Moore, “Data Mining And Thev Evolution Of Biological Complexity”, Biodata Mining 2011, 4:7.