r

SUBMISSION REQUIREMENTS: Please submit a single R script file named with your “First_Last Name.R” ONLY.  Your R script code must calculate the effectivness of your classification as described below.
Similar to the classification example.  process and classify the newsgroup document data. Download this data  and save it on your computer in your R packages folder under “tm/text/”. Your code MUST access it from there!
Note that the data is separated into one test and one train folder, each containing 20 sub folders on different subjects. Choose these 2 subjects to analyze (sci.space and rec.autos) and 100 documents from each.  
Consider “rec.autos” as positive and “sci.space” as negative event. Note that kNN  syntax expects (Positive First, Negative second)
Classify the Newsgroups data (by date version data set) from Blackboard:
•        Save data in your “tm/text/” folder so you can specify path using system.file()
•       Note that the data is separated into one test and one train folder, each containing 20 sub folders on different subjects.
Choose these 2 subjects to analyze (sci.space and rec.autos) and 100 documents from each.
•        For each subject select:
–       100 documents for training from the train folder
–       100 documents for testing from the test folder
•        Obtain the merged Corpus (of 400 documents), please keep the order as
–       Doc1.Train from the “sci.space” newsgroup train data
–       Doc1.Test from the “sci.space” newsgroup test data
–       Doc2.Train from the ” rec.autos” newsgroup train data
–       Doc2.Test from the ” rec.autos” newsgroup test data
•        Implement preprocessing (clearly indicate what you have used)
•        Create the Document-Term Matrix using the following arguments (word lengths of at least 2, word frequency of at least 5)
–      use: control=list(wordLengths=c(2,Inf), bounds=list(global=c(5,Inf)))
•        Split the Document-Term Matrix into proper test/train row ranges
–       train range containing rows (1:100) and (201:300)
–       test range  containing rows (101:200) and (301:400)
–       Note that knn expects the positive (“Rec”) event as first, so re-adjust your train/test range if necessary.  
•        Use the abbreviations “Positive” and “Negative” as tag factors in your classification.
–       Check if the tag order is correct using table(Tags)
–       You should get
•        Tags
•        Positive Negative
•        100      100
–       If your order is not right make proper changes.
•        Classify text using the kNN() function
•        Display classification results as a R dataframe and name the columns as:
–       “Doc”
–       “Predict”  – Tag factors of predicted subject (Positive or Negative)
–       “Prob” – The classification probability
–       “Correct’ – TRUE/FALSE
•        What is the percentage of correct (TRUE) classifications?
•        Estimate the effectiveness of your classification:
– Calculate and  clearly mark the values TP, TN, FP, FN
–       Create the confusion matrix and name the rows and columns with what is Positive/Negative event
–       Calculate Precision
–       Calculate Recall
–       Calculate F-score
Note that one way you can select only 100 documents is
> Temp1 <- DirSource(Doc1.TestPath)
> Doc1.Train <- Corpus(URISource(Temp1$filelist[1:100]),readerControl=list(reader=readPlain))

Calculate the price
Make an order in advance and get the best price
Pages (550 words)
$0.00
*Price with a welcome 15% discount applied.
Pro tip: If you want to save more money and pay the lowest price, you need to set a more extended deadline.
We know how difficult it is to be a student these days. That's why our prices are one of the most affordable on the market, and there are no hidden fees.

Instead, we offer bonuses, discounts, and free services to make your experience outstanding.
How it works
Receive a 100% original paper that will pass Turnitin from a top essay writing service
step 1
Upload your instructions
Fill out the order form and provide paper details. You can even attach screenshots or add additional instructions later. If something is not clear or missing, the writer will contact you for clarification.
Pro service tips
How to get the most out of your experience with MyhomeworkGeeks
One writer throughout the entire course
If you like the writer, you can hire them again. Just copy & paste their ID on the order form ("Preferred Writer's ID" field). This way, your vocabulary will be uniform, and the writer will be aware of your needs.
The same paper from different writers
You can order essay or any other work from two different writers to choose the best one or give another version to a friend. This can be done through the add-on "Same paper from another writer."
Copy of sources used by the writer
Our college essay writers work with ScienceDirect and other databases. They can send you articles or materials used in PDF or through screenshots. Just tick the "Copy of sources" field on the order form.
Testimonials
See why 20k+ students have chosen us as their sole writing assistance provider
Check out the latest reviews and opinions submitted by real customers worldwide and make an informed decision.
BUSINESS LAW
excellent job made a 93
Customer 452773, March 22nd, 2023
Social Work and Human Services
Although it took 2 revisions I am satisfied but I did receive it late because of that.
Customer 452603, March 25th, 2021
Human Resources Management (HRM)
excellent job
Customer 452773, July 17th, 2023
Business and administrative studies
excellent job
Customer 452773, March 12th, 2023
business
Thank you for your hard work and help.
Customer 452773, February 13th, 2023
Management
Love this writer!!! Great work
Customer 452597, April 5th, 2021
DATA565
The support team was late responding , my paper was late because the support team didn't respond in a timely manner. The writer of the paper finally got it right but seems there was a problem getting the revisioin to me.
Customer 452773, April 7th, 2024
Business and administrative studies
Excellent job
Customer 452773, March 9th, 2023
Human Resources Management (HRM)
excellent
Customer 452773, July 11th, 2023
English 101
great summery in terms of the time given. it lacks a bit of clarity but otherwise perfect.
Customer 452747, June 9th, 2021
Human Resources Management (HRM)
excellent, great job
Customer 452773, June 19th, 2023
Nursing
I just need some minor alterations. Thanks.
Customer 452547, February 10th, 2021
11,595
Customer reviews in total
96%
Current satisfaction rate
3 pages
Average paper length
37%
Customers referred by a friend
OUR GIFT TO YOU
15% OFF your first order
Use a coupon FIRST15 and enjoy expert help with any task at the most affordable price.
Claim my 15% OFF Order in Chat
Close

Sometimes it is hard to do all the work on your own

Let us help you get a good grade on your paper. Get professional help and free up your time for more important courses. Let us handle your;

  • Dissertations and Thesis
  • Essays
  • All Assignments

  • Research papers
  • Terms Papers
  • Online Classes
Live ChatWhatsApp