Create an arff file with the data types

Assignment Help Database Management System
Reference no: EM1373702

Google Weka and find the homepage for Weka.

When you install it, you may need to change your classpath to reference the jar file: weka-3-6\weka-src.jar

In the folder, you will find the icon to run Weka. Once you have it running.

Feel free to play around with the tool.

Part 2: Understand ARFF Format

Weka uses the ARFF (attribute-relation file format) file format.

Read through this document and familiarize yourself with the format.In addition, read chapters 1 and 2 of the text. Make sure you understand how sparse data is handled.You are expected to know this format well by next Tuesday!

Part 3: Problem

• create an arff file with the following data types
o flags, unit_id, names must be nominal
o timestamps (ts) must be date
o users and other ids must be numeric
o comments must be strings
• create an arff file that contains sparse data (eliminate the timestamps)
• test that these files can be loaded into Weka. You can load these via the Explorer or the Tools/ArffViewer.

There are arff convertor programs available online and Weka is able to read csv files and create arff files, but they won't create the files using the required data types. They can be used to get you started though.

Dataset 2

id,unit_id,name,created_ts,created_user,deactivated_ts,deactivated_user,active_flag,comments

1,ACC,Minor ACC,2/2/2001 22:00,21,NULL,NULL,1,

2,ACC,BA ACC-CMA,8/13/2001 6:34,21,NULL,NULL,1,

3,ACC,BS ACC-CMA,2/2/2001 7:15,21,2/2/2001 17:30,9,0,

8,MTH,BS Actuarial Science,10/12/2001 20:15,9,NULL,NULL,1,

11,MTH,BS Applied Mathematics,2/2/2001 12:00,9,8/13/2004 6:34,9,0,dropped

16,BIO,BA Biology,3/12/2001 19:34,21,NULL,NULL,1,

17,BIO,BS Biology,2/7/2001 12:00,13,2/21/2001 12:45,9,0,renamed

30,CSC,BA Computer Science,2/21/2001 12:00,21,NULL,NULL,1,

31,CSC,BS Computer Science,2/2/2001 8:43,9,NULL,NULL,1,

Reference no: EM1373702

Questions Cloud

Fundamentals of economic analysis : GRAND RAPIDS, Mich. - Kellogg Company on Monday said its earning growth 17.3% in the 2nd quarter on strong firm wide sales growth, beating Wall's Street's expectations.
Balance between short range and long range goals : General Electric is a large, highly decentralized company. At present it developed these goals, GE had approximately 170 responsibility centers called "departments,"
Acceleration of economic development : How would you account for the great divergence that is acceleration of economic development in the West in 19th century while much of the rest of world remained characterized through low rates of economic growth?
Multiple choice questions - macroeconomics : Assume a society manufacture only guns and butter. When it uses all its  resources for the production of guns and operates efficiently, it can manufacture  240 guns a year.
Create an arff file with the data types : Create an arff file with the following data types, flags, unit_id, names must be nominal and timestamps (ts) must be date
Calculating rate of growth in fuel costs : It costs $2600 to insulate a factory. Next year, the fuel savings will be $220. Every year after this, the expense of fuel is expected to increase by the rate g.
Multiple choice questions about t bills supply : Determine which of the following is not a major component of the Federal Reserve System? Kudrow stock just paid a dividend of $4.76 a share and plans to pay a dividend of $5 a share next year, which is expected to increase three 3% per year subsequen..
Maintaining economic growth through control inflation : In his semi yearly testimony to the Senate banking committee past summer Alan Greenspan commented on the recent Fed funds rate hike in late June 2004;
Adopt an ingredient branding strategy : Is there any way for Boeing to adopt an ingredient branding strategy with their jets and how - What would be the pros and cons?

Reviews

Write a Review

Database Management System Questions & Answers

  Complete information-level design for set of requirements

A database at a college is required to support the following requirements. Complete the information-level design for this set of requirements. Determine any constraints you need that are not stated in the problem.

  Explain what information is available in relational database

Explain what information is available from relational database containing one relation with attributes Name, Employee identification number, and Address which is not available.

  Prepare a database using microsoft access

Using Microsoft Access, prepare a database and save it as Acme Inc. Prepare the following tables: Employees and Products. Field names for Employees table are first name,

  Design tables in 3nf various codes for at least three fields

Create tables in 3NF. As you create the database, include different codes for at least three of the fields. Use sample data to populate fields for at least three records in each table.

  Same name to attributes which are in different tables

What about giving same name to attributes which are in different tables but are not same? For instance, "Description" in both a Course table and a Classroom table.

  Characteristics of database

Describe the database and describe the four characteristics of the database? Explain the Relational Database and generate a relational database for five employees.

  How to use traditional database design method

Explain how you would follow three phases of traditional database design method (Hierarchical, Network, and Relational), considering the following scenario.

  Explain the datawarehouse and data mining concepts

There are six major types of information systems which organisations use in their operations. Discuss how these information systems support managers in their decision making role Explain the datawarehouse and data mining concepts using appropria..

  Ways of implementing one-to-one relationships

Describe the difference ways of implementing one-to-one relationships. Assume you are maintaining information on offices (office numbers, building, and phone numbers)

  Computing functional dependencies

Compute the functional dependencies which exist in following table. After determining the functional dependencies, transform this table to an equivalent collection of the tables which are in third normal form.

  Create a database design - relational data model

Create a database design specification Enhanced Entity Relationship Diagram (EERD) and Relational Data Model from the given business description - Database Management System

  Essay on data mining in warehouse architectures

Course: data mining. Require a 7 page essay on subjects: Warehouse Architectures: the paper requires to contain information about centralized, federated, and tiered data warehouse.

Free Assignment Quote

Assured A++ Grade

Get guaranteed satisfaction & time on delivery in every assignment order you paid with us! We ensure premium quality solution document along with free turntin report!

All rights reserved! Copyrights ©2019-2020 ExpertsMind IT Educational Pvt Ltd