美文网首页
讲解:IT270、R、R、datasetsDatabase|R

讲解:IT270、R、R、datasetsDatabase|R

作者: kuijinzhong | 来源:发表于2020-01-12 11:45 被阅读0次

Final #1 (ID:162 - AKA THANOS)IT270ALEXANDER PELAEZInstructions.Please make sure if you use R you copy and paste it into Word using Courier Font (makes it easier to Read). For each of the problems that are looking for a response (not just a calculation), be sure to explain and interpret the results. If you aren’t sure…ASK PLEASE.Please start each question on a new page and clearly label that start of each problem (Maybeslightly larger font, bold face , underline… ) anything that will help me find the problem you areworking on.As a final exam you are to work on your problems individually. However, you may discusstechniques and approaches. You may not copy code or answers and copying other studentscode and answers could result in major penalty or even failure on the exam or the class.1. People Analytics (GoT...Analytics)The CEO is concerned regarding recent events at the company, and wants to initiate a newprogram. It is concerned with gender discrimination and equitability across the board. They areseeking to also increase overall job satisfaction.Using the employee datasets (from the midterm), provide a determination if there is genderdiscrimination in the workplace, and provide an assessment of factors that should be looked atfor job satisfaction and how these factors can be used. This should be a very comprehensivereview.A few notes:You must come up with appropriate methodologies and applications of those methodologies.Your final answer should be in the form of text with supporting facts (maybe from your analysisin R). Expect this to be read as a report by the CEO. At the end of your answer you can provideall necessary R code and outputs.Any points you raise must be justified. If you try something let’s say you wanted to determine ifemployees that have dogs instead of cats are important. You should list it and try and say itdidn’t work...make sure it was relevant in the first place. You may write even something as smallas a line or two on that. If something is important...write about it.You may ask me any question you like on this and I will answer it as best as I can. If you ask,how many methods do I need… answer from CEO ( I don’t know you’re the expert...that’s what Ipay you for). You may use techniques from BAN203 or BAN250, but they cannot be the core ofthe paper, they can only be used as support or justification.2. Handwriting Recognition: ENDGAME (Continuation of HW3)Your colleague was called into the bosses office and was reprimanded. Your colleague arguedthat your prior analysis is wrong and you didn’t explain anything about the model.a) Why was your colleague wrong in the first place?b) Your colleague reran the code and claims that a one layer ,40 hidden node modelactually works best, since there are ten numbers and each of the nodes gives a 25%probability of hiIT270留学生作业代做、代写R程序设计作业、代做R编程语言作业调试、datasets作业代做 代做Database|代tting a number from 0-10, and thus each hidden node can be explainedclearly. Explain why this is a good rationale or notc) Re run your neural network, however, this time you have a training set of 60,000 and thetest set of 10,000 (this may take some time to run).i) Provide a table and graph of the error rates in your training set and test based onthe number of nodes chosen in your layers. Since there may be quite a numberof combinations, be effective in how you approach this.ii) Provide your final answer, with the error rate and time it takes to complete theanalysis.d) Consider all the techniques we learned in classi) Can any other technique be used to perform the classification. If so list thetechniques with a one-line rationale as to whyii) Run the techniques in part 1, and provide the final answer for that techniquealong with the training error rate and test rate (be sure to indicate any other trialsyou had).e) Consider everything above, write a summarized conclusion of everything above youwould hand to a Chief Analytics Officer.f) After this, what should happen to your colleague?3. Performance Optimization (Return of the FIFA)The FIFA Director of Analytics isn’t completely convinced with your prior analysis. The belief isthat your analysis isn’t useful. Use the original fifa.csv dataset.a) Explain how your prior FIFA analysis was useful and how it can be used.Ultimately there needs to be a unifying factor across all techniques used. If the primaryquestion is how to build a team then your job will be to develop a methodology that willassist in this, therefore,b) How would you group players ? Be sure to establish what fields are necessary prior togrouping and indicate how you grouped them. Then describe the groups.c) Using the groups can you assess the clubs. Explain if this is useful.d) How can your initial analysis (from Homework 2) be integrated with b and/or c?e) Can you find any other useful and interesting analytics (from our techniques) that wouldbe helpful to the Director of Analytics.6. Theoretical Problemsa) Michele Piccinno a researcher in Artificial Neural Networks stated (2016), “NeuralNetworks are the second best way to solve a problem. The best way is to understand theproblem”. Provide an opinion based on your knowledge of whether you agree ordisagree with this statement?b) Explain how accuracy is measured across the techniques and why is it appropriate forthose techniques?c) What are some of the challenges with data when starting a project and what would yourinitial steps be?d) Explain how a decision tree can be used to predict a continuous variable. How are theresults interpreted?e) Provide an example of an ethical problem with data mining, and what an analyst shouldconsider and how they should mitigate the problem, if possible.转自:http://www.7daixie.com/2019050556937603.html

相关文章

网友评论

      本文标题:讲解:IT270、R、R、datasetsDatabase|R

      本文链接:https://www.haomeiwen.com/subject/onmiactx.html