Genetic based method for Mining Association Rules from Text
Main Article Content
Abstract
In general, text mining is the process of discovering very important and unknown information from a semi-structured or unstructured data set, while data mining deals with structured data. Where natural language processing technology was used: coding, stop word deletion, derivation and indexing of the document, convert text into structured form, and in order to extract correlation rules, we used the genetic algorithm (GA) to solve the correlation rule problem by overcoming the limitations of using the priori algorithm (a coding system, storage capacity). Were the estimates are based or based on a recommendation to use the individual variable length within the population. The rules for extracted relationships contain the essential features included in a document compilation. The proposed system aims to solve the problems facing traditional techniques as it works to address two main issues: Computational complexity, where the number of rules increases greatly with the number of elements in the database. Second, the rules related to interests must be collected within the place of creation of the rules that usually suggest a higher algorithm. The use of a genetic algorithm to find a relationship in the unorganized document between keywords is powerful and very effective in solving this problem because it effectively cuts the search space for threshold based consistency measurements.
Downloads
Metrics
Article Details
Licensing
TURCOMAT publishes articles under the Creative Commons Attribution 4.0 International License (CC BY 4.0). This licensing allows for any use of the work, provided the original author(s) and source are credited, thereby facilitating the free exchange and use of research for the advancement of knowledge.
Detailed Licensing Terms
Attribution (BY): Users must give appropriate credit, provide a link to the license, and indicate if changes were made. Users may do so in any reasonable manner, but not in any way that suggests the licensor endorses them or their use.
No Additional Restrictions: Users may not apply legal terms or technological measures that legally restrict others from doing anything the license permits.