Generating noisy monotone ordinal datasets

Irena Milstein, Arie Ben David, Rob Potharst

Abstract


Ordinal decision problems are very common inreal-life. As a result, ordinal classification models have drawn much attentionin recent years. Many ordinal problem domains assume that the output is monotonouslyrelated to the input, and some ordinal data mining models ensure this propertywhile classifying. However, no one has ever reported how accurate these modelsare in presence of varying levels of non-monotone noise. In order to do thatresearchers need an-easy-to-use tool for generating artificial ordinal datasetswhich contain both an arbitrary monotone pattern as well as user-specifiedlevels of non-monotone noise. An algorithm that generates such datasets is presentedhere in detail for the first time. Two versions of the algorithm are discussed.The first is more time consuming. It generates purely monotone datasets as thebase of the computation. Later, non-monotone noise is incrementally inserted tothe dataset. The second version is basically similar, but it is significantlyfaster.  It begins with the generation ofalmost monotone datasets before introducing the noise. Theoretical andempirical studies of the two versions are provided, showing that the second,faster, algorithm is sufficient for almost all practical applications. Some usefulinformation about the two algorithms and suggestions for further research arealso discussed.

Full Text: PDF DOI: 10.5430/air.v3n1p30

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution 3.0 License.

Artificial Intelligence Research

ISSN 1927-6974 (Print)   ISSN 1927-6982 (Online)

Copyright © Sciedu Press 
To make sure that you can receive messages from us, please add the 'Sciedu.ca' domain to your e-mail 'safe list'. If you do not receive e-mail in your 'inbox', check your 'bulk mail' or 'junk mail' folders.