Wednesday, May 6, 2020

Assgn - 2175 Words

Assignment 1: Using the WEKA Workbench A. Become familiar with the use of the WEKA workbench to invoke several different machine learning schemes. Use latest stable version. Use both the graphical interface (Explorer) and command line interface (CLI). See Weka home page for Weka documentation. B. Use the following learning schemes, with the default settings to analyze the weather data (in weather.arff). For test options, first choose Use training set, then choose Percentage Split using default 66% percentage split. Report model percent error rate. ZeroR (majority class) OneR Naive Bayes Simple J4.8 C. Which of these classifiers are you more likely to trust when determining whether to play? Why? D. What can you say about†¦show more content†¦1,2,..38) and an Affymetix call (P is gene is present, A if absent, M if marginal). Think of the training data as a very tall and narrow table with 7130 rows and 78 columns. Note that it is sideways from machine learning point of view. That is the attributes (genes) are in rows, and observations (samples) are in columns. This is the standard format for microarray data, but to use with machine learning algorithms like WEKA, we will need to do matrix transpose (flip) the matrix to make files with genes in columns and samples in rows. We will do that in step 3B.6 of this assignment. Here is a small extract Gene Description Gene Accession Number 1 call 2 call ... GB DEF = GABAa receptor alpha-3 subunit A28102_at 151 A 263 P ... ... AB000114_at 72 A 21 A ... ... AB000115_at 281 A 250 P ... ... AB000220_at 36 A 43 A ... 3B: Clean the data Perform the following cleaning steps on both the train and test sets. Use unix tools, scripts or other tools for each task. Document all the steps and create intermediate files for each step. After each step, report the number of fields and records in train and test files. (Hint: Use unix command wc to find the number of records and use awk or gawk to find the number of fields). Microarray Data Cleaning Steps 1. Remove the initial records with Gene Description containing control. (Those are Affymetrix controls, not human genes). Call the resulting files ALL_AML_grow.train.noaffy.tmp andShow MoreRelatedBUS210 Assgn 1 Essay2094 Words   |  9 Pagesï » ¿Tiffany Simpson April 13, 2015 BUS 210 Assignment 1 Case Study 1 2 Instructor Divya Kashyap t.simpson3@students.clark.edu Case 1 Amazon.com 1: Toys R Us sales exceeded $300 Million by 2004 on the Amazon.com site. In about 200 words explain how Amazon, Toys R Us, and other toy sellers who participated in Amazons Marketplace retailer program benefited from the network effect as a result of the relationship between Amazon and Toys R Us. Toys R Us and other toy sellers who participatedRead Morelegally astute manager leg100 assgn 11660 Words   |  7 Pagesï » ¿ In my opinion any marketing manager who is not utilizing social media as a marketing tool is failing to utilize an extremely cost effective means of reaching their target market. There are a wide array of sites that an shrewd manager can utilize to market their products, sites such as Facebook, Yahoo, Google, Bing, My space, YouTube and many others. I believe that the site that the marketing manager chooses would depend on the type of customer that they are attempting to target. Actually

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.