How to get predication value for an instance in we

2019-01-14 09:22发布

问题:

I am working on Weka and need to output the predication values (probabilities) of each labels for each test instance.

In GUI there is an option in classify tab as (classify -> options -> Output predicted value) which does this work by outputting the prediction probabilities for each label but how to do this in java code. I want to receive probability scores for each label after classifying it ?

回答1:

The following code takes in a set of training instances, and outputs the predicted probability for a specific instance.


import weka.classifiers.trees.J48;
import weka.core.Instances;

public class Main {

    public static void main(String[] args) throws Exception
    {
        //load training instances
        Instances test=...

        //build a J48 decision tree
        J48 model=new J48(); 
        model.buildClassifier(test);

        //decide which instance you want to predict
        int s1=2;

        //get the predicted probabilities 
        double[] prediction=model.distributionForInstance(test.get(s1));

        //output predictions
        for(int i=0; i<prediction.length; i=i+1)
        {
            System.out.println("Probability of class "+
                                test.classAttribute().value(i)+
                               " : "+Double.toString(prediction[i]));
        }

    }

}

The method "distributionForInstance" only works for classifiers capable of outputting distribution predictions. You can read up on it here.



回答2:

I think I found the solution.

The training set and the test set must be equal: same header, same name of attributes, same order. Only changes the numbers. And the question is: why do I have to put the class in the test set if I don’t know it, and precisely it is what I want to obtain? It seems that the method needs something on that, but when you have a look at classModel.distributionForInstance(dataModel.instance(0)) , it gives you the prediction on your classes with an array of double. So, it is necessary to put some values of the classes in the test set, and later the ‘distributionForInstance’ gives you the real result for your classes.



回答3:

From WEKA GUI, Classify panel -> press the "More options..." -> Output predictions -> Choose "PlainText" option. Now, left-click on "PlainText" and turn the "outputDistribution" into "True".

Note that, this process can be performed in last WEKA versions, e.g., WEKA 3.8.0.

Regards,
Martin



标签: java weka