Giter VIP home page Giter VIP logo

Comments (10)

fracpete avatar fracpete commented on September 22, 2024

Maybe the updating didn't quite work... Here are my steps:

  • download the latest release (at the time of writing 2015.1.13)
  • remove the existing collective-classification package (I simply removed the $HOME/wekafiles/packages/collective-classification directory)
  • fire up Weka and open the Package manager
  • install the new ZIP file that you downloaded
  • restart Weka and open the Explorer

This is what the Explorer then looks like:

collective_unlabeled_test_set

from collective-classification-weka-package.

petrjirasek avatar petrjirasek commented on September 22, 2024

@fracpete Can I ask you about the settings? I don't want to open a new issue. Thank you.

I am wondering how to set train, unlabeled and test set correctly. For instance, I have a dataset of 300 instances and I would like to test how good are in my case some collective classifiers.

I have created labeled train set 60 instances, 240 unlabeled instances as unlabeled set and the same 240 instances as test set, but the weka returns alert Attributes differ in attribute 2. Different number of labels 0 != 2 when I set it.

So, it means that unlabeled set must have labeled instances which are during building a training model ignored? In my case, the unlabeled test and test set will be probably the same, right?

Thank you for your help!
Petr

from collective-classification-weka-package.

fracpete avatar fracpete commented on September 22, 2024

I presume that you're using ARFF files. All your files, train/unlabeled/test, must have exactly the same structure (same attribute names, types, order of labels). Just make sure that your ARFF header for the unlabeled dataset also lists the 2 labels that your test set has. But you don't need to have labels in the unlabeled dataset, simply use the question mark for denoting a missing value instead. Yes, during build time any class labels from the unlabeled dataset get ignored. This allows you to use a test set also as unlabeled dataset.

from collective-classification-weka-package.

petrjirasek avatar petrjirasek commented on September 22, 2024

Thank you. It really helps. Sorry for interrupting but I would like to ask you one more question. I have tried the output option and set saver to CSV and set a output file file located in Desktop folder. But the saver after evaluating does not save anything. It's really weird. But uutput in classify tab for classifiers works well. Thank you,

from collective-classification-weka-package.

fracpete avatar fracpete commented on September 22, 2024

That was a nice bug. It worked for cross-validation, but not for the other test modes. I've fixed that and made a new released available:
https://github.com/fracpete/collective-classification-weka-package/releases/tag/v2015.2.27
Thanks for reporting the bug!

from collective-classification-weka-package.

petrjirasek avatar petrjirasek commented on September 22, 2024

Thanks for fast fix. I have just tried it and there is some problem in evaluation but it can be my mistake. I have created train set, unlabeled set and test set and try to evaluate it. But weka returns an error during processing - Problem evaluating - Cannot find instance (you can see it in the screen).
weka screen
I have uploaded my sets to dropbox here:
https://www.dropbox.com/sh/5afxvdtltkb8brc/AADN__F4ZCMpCDol_Dd8WqNFa?dl=0

I have checked count of attrbutes and other things and it's the same in all sets. I used FilteredCollectiveClassifier, where classifier is set to NaivebayesMultinomial in YATSI and filter is set to Remove -R first
Class is assigned to class attribute.

Thank you for your help. You are wonderful.

from collective-classification-weka-package.

fracpete avatar fracpete commented on September 22, 2024

That was a limitation of some of the algorithms, that an identical (unlabeled) instance has to be present. Please note, that no development on the algorithms has happened in the last 7 years, I only made the existing code available as Weka package. Please submit a new issue with this information. I may have time to look into that - however, this is a very low-priority bug issue for me.

from collective-classification-weka-package.

petrjirasek avatar petrjirasek commented on September 22, 2024

SOrry, I don't completely understand: What does it means? "That was a limitation of some of the algorithms, that an identical (unlabeled) instance has to be present."
It means that every instance in unlabeled set has to be in test set?

Thank you for your answer.

from collective-classification-weka-package.

fracpete avatar fracpete commented on September 22, 2024

Yes.

from collective-classification-weka-package.

petrjirasek avatar petrjirasek commented on September 22, 2024

Ok, thanks!

from collective-classification-weka-package.

Related Issues (5)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.