![]() ![]() ![]() Values that contain spaces must be quoted. Fisher % (b) Donor: Michael Marshall % (c) Date: July, 1988 % iris sepallength NUMERIC sepalwidth NUMERIC petallength NUMERIC petalwidth NUMERIC class An example header on the standard IRIS dataset looks like this: The Header of the ARFF file contains the name of the relation, a list of the attributes (the columns in the data), and their types. The first section is the Header information, which is followed the Data information. The same file in ARFF format looks as follows: sepallength sepalwidth petallength petalwidth class Iris-setosaĪRFF files have two distinct sections. It is an extension of the CSV file format where a header is used that provides metadata about the data types in the columns.įor example, the first few lines of the classic iris flowers dataset in CSV format looks as follows: Weka prefers to load data in the ARFF format.ĪRFF is an acronym that stands for Attribute-Relation File Format. Stringfor lists of words, like this sentence.Nominal for categorical data like “dog” and “cat”.Integer for numeric values without a fractional part like 5.Attribute: A column of data is called a feature or attribute, as in feature of the observation.Įach attribute can have a different type, for example:.Instance: A row of data is called an instance, as in an instance or observation from the problem domain.Weka has a specific computer science centric vocabulary when describing data: This is called tabular or structured data because it is how data looks in a spreadsheet, comprised of rows and columns. Machine learning algorithms are primarily designed to work with arrays of numbers. ![]()
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
December 2022
Categories |