Heh there! I just need help to implement a naive Bayes text classification algorithm in Java, just to check the dataset for research purposes. It is imperative to implement the algorithm in Java; rather, using Weka or Rapid Miner tools to get results!
My dataset has the following data type:
Doc Words Category
means that I have Training words and categories for each training (String) that was known in advance. Some of the datasets are listed below:
Doc Words Category Training 1 Integration Communities Process Oriented Structures...(more string) A 2 Integration Communities Process Oriented Structures...(more string) A 3 Theory Upper Bound Routing Estimate global routing...(more string) B 4 Hardware Design Functional Programming Perfect Match...(more string) C . . . Test 5 Methodology Toolkit Integrate Technological Organisational 6 This test contain string naive bayes test text text test
SO, the dataset comes from the MySQL database and may contain several training lines and test lines! The thing is, I just need to implement a naive Bayes text classification algorithm in Java.
The algorithm should follow the example below. Table 13.1
Source: Read here.
The fact is, I can implement the algorithm in Java Code myself, but I just need to know if it is possible that there is some kind of Java library with source code documentation that allows me to simply check the results.
The problem is that I just need the results in one go, it just means checking the results.
So, get to the point if anyone can tell me about some good java library that helps my code use this algorithm in Java, and that can make my dataset possible to process the results, or can someone give me good ideas how to do it easily ... something good that can help me.
I will be grateful for your help. thanks in advance
source share