代写RP position、代做Python程序语言、代写Python、代做data留学生-代写Python编程

联系方式

QQ：99515681
邮箱：99515681@qq.com
工作时间：8:00-21:00
微信：codinghelp

您当前位置：首页 >> Python编程Python编程

代写RP position、代做Python程序语言、代写Python、代做data留学生

日期：2019-12-08 07:36

Coding test for RP position

Instructions

Please complete the two coding tasks below. The deliverable for each task should consist of (1) data files

(usually CSV or Excel), (2) the code you used to generate the data file. The code should include

annotations to help the reader understand what is being done. Both tasks must be completed using Python.

These tasks should be relatively straightforward; please take no longer than 12 hours to complete.

TASK 1

Using the Python Requests package make a call to the S&P 100 Wikipedia page. Using Python BeautifulSoup,

extract the name, symbol, and link of the firms composing the index.

Then, for each company, find the best match in the file “names gvkeys.CSV" using the Python package

“fuzzy-wuzzy".

The final dataset must have five columns: the company name, the symbol, the link to the company

Wikipedia page, the matched company name and the score assigned to the match.

Deliverables:

1. Export the result as either an Excel or CSV format

TASK 2

Your task is to analyze the 10-K forms for two companies, “Apple Inc." and “Intel”, for the fiscal years

2014 through 2018.

• These forms are available on the SEC EDGAR website.

• Please note that fiscal years are not necessarily the same as calendar years.

Retrieve these documents and complete the following two sections.

First part

Please determine and tabulate the following for each fiscal year:

1. the company name

2. the number of words in the filing's “Risk Factors" section

3. the number of times the word “competition" is mentioned in this section

Deliverables:

1. Export the result as either an Excel or CSV format

2. Export the text of the “Risk Factors" sections as separate txt files for each company-year

Second part

Using the nltk Python package, remove all the stop words from the “Risk factors" section and, using

regular expressions, identify all the occurrences of the word “patent" or its variations (“patents”,

“patenting”, etc.). Please also retrieve the word preceding and the word following the occurrence found.

In other words, you will have to construct a dataset with the “word preceding", the “occurrence found",

and the “word following".

Deliverables:

1. Export the result as either an Excel or CSV format.

【返回顶部】【打印本稿】【关闭本页】

【上一篇】：代写ECON215留学生、代写Python设计、Python编程语言调试

【下一篇】：代写ECON215留学生、代写Python设计、Python编程语言调试

联系方式

最新辅导

热门辅导

您当前位置：首页 >> Python编程Python编程

代写RP position、代做Python程序语言、代写Python、代做data留学生

日期：2019-12-08 07:36

相关文章