<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="ru">
	<id>https://wikicshse.ru/index.php?action=history&amp;feed=atom&amp;title=Data_Science_for_Business_2020</id>
	<title>Data Science for Business 2020 - История изменений</title>
	<link rel="self" type="application/atom+xml" href="https://wikicshse.ru/index.php?action=history&amp;feed=atom&amp;title=Data_Science_for_Business_2020"/>
	<link rel="alternate" type="text/html" href="https://wikicshse.ru/index.php?title=Data_Science_for_Business_2020&amp;action=history"/>
	<updated>2026-06-07T21:16:32Z</updated>
	<subtitle>История изменений этой страницы в вики</subtitle>
	<generator>MediaWiki 1.45.3</generator>
	<entry>
		<id>https://wikicshse.ru/index.php?title=Data_Science_for_Business_2020&amp;diff=182&amp;oldid=prev</id>
		<title>imported&gt;Kurmukovai: /* Seminar&#039;s materials */</title>
		<link rel="alternate" type="text/html" href="https://wikicshse.ru/index.php?title=Data_Science_for_Business_2020&amp;diff=182&amp;oldid=prev"/>
		<updated>2020-12-09T14:14:15Z</updated>

		<summary type="html">&lt;p&gt;&lt;span class=&quot;autocomment&quot;&gt;Seminar&amp;#039;s materials&lt;/span&gt;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;Новая страница&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
&lt;br /&gt;
== About the Course ==&lt;br /&gt;
&lt;br /&gt;
Data Science for Business. MAGoLEGO course.&lt;br /&gt;
&lt;br /&gt;
Spring 2020. Module 4.&lt;br /&gt;
&lt;br /&gt;
Department of Data Analysis and Artificial Intelligence, School of Computer Science.&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt;Join our telegram channel &amp;lt;/span&amp;gt; [https://t.me/joinchat/ENzQEhr-hra2WhEjxvgayw Data science for business.]&lt;br /&gt;
&lt;br /&gt;
===Instructors===&lt;br /&gt;
&lt;br /&gt;
[https://www.hse.ru/staff/lzhukov Prof. Leonid Zhukov] &lt;br /&gt;
&lt;br /&gt;
[https://www.hse.ru/staff/iamakarov Ilya Makarov]&lt;br /&gt;
&lt;br /&gt;
[https://www.hse.ru/staff/intergalactic_admiral/ Anvar Kurmukov]&lt;br /&gt;
&lt;br /&gt;
===Links===&lt;br /&gt;
* Alternative Course website [http://www.leonidzhukov.net/hse/2020/datascience/]&lt;br /&gt;
* Lectures link https://zoom.us/j/7723819319 Fridays, 6.10pm - 7.30pm&lt;br /&gt;
* Seminars link https://zoom.us/j/636910206 Fridays, 7.40pm - 9.00pm&lt;br /&gt;
&lt;br /&gt;
===Course outline===&lt;br /&gt;
&lt;br /&gt;
* Introduction to data science&lt;br /&gt;
* Data mining, statistics, machine learning, optimization&lt;br /&gt;
* Case studies&lt;br /&gt;
* Increasing business impact&lt;br /&gt;
&lt;br /&gt;
===Content===&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
|-&lt;br /&gt;
! № !! Date !! Title !! Abstract&lt;br /&gt;
|-&lt;br /&gt;
| 1 || 10.04.2020 || Introduction to data science. || Introduction to data science and its role in industry. Examples of real world use cases.  &lt;br /&gt;
|-&lt;br /&gt;
| 2 ||17.04.2020  || Working with data.            ||   Data cleaning and preparation. ETL process. Basic data analysis and visualization.               &lt;br /&gt;
|-&lt;br /&gt;
| 3 ||  24.04.2020          ||  Data mining, machine learning, statistics || Types of ML algorithms, applicability, training and testing, solution quality.   &lt;br /&gt;
|-&lt;br /&gt;
| 4 ||   15.05.2020         || Case study 1: Pricing||   The goal of the case is to compute price elasticity. &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Algorithms&amp;#039;&amp;#039;&amp;#039;: Supervised learning: linear and non-linear regression, predicting continuous variable. Dimensionality reduction: PCA.        &lt;br /&gt;
|-&lt;br /&gt;
| 5 ||    22.05.2020        ||Case study 2: Churn modeling||The goal of the case is to predict which customers are going to leave the service within a given time. &lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Algorithms&amp;#039;&amp;#039;&amp;#039;: Supervised learning. Classification: Logistic regression, Decision trees, Random forest.&lt;br /&gt;
|-&lt;br /&gt;
| 6 || 29.05.2020   ||Case study 3: Customer segmentation ||The goal of the case is to group customers into clusters based on some customer similarity metrics.&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Algorithms&amp;#039;&amp;#039;&amp;#039;: clustering – k-means, agglomerative, dimensionality reduction - PCA.&lt;br /&gt;
|-&lt;br /&gt;
| 7 || 05.06.2020||Case study 4: Personalizaton ||The goal of the case is to build a recommender system.&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Algorithms&amp;#039;&amp;#039;&amp;#039;: association rules and collaborative filtering.&lt;br /&gt;
|-&lt;br /&gt;
| 8 || 12.06.2020 ||Impacting the business ||How to create a visible impact on business with analytics&lt;br /&gt;
|-&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
===Seminar&amp;#039;s materials===&lt;br /&gt;
&lt;br /&gt;
[https://yadi.sk/i/eJm4z1dwLOAXhw Seminar 1. video. ]&lt;br /&gt;
&lt;br /&gt;
[https://yadi.sk/d/aAYt0omgokKuRg Seminar 2. video.], RM processes [https://yadi.sk/d/ij1bgNTd7VFJQQ COVID regression], [https://yadi.sk/d/XvLpO7frlD_l-A COVID], [https://yadi.sk/d/NhmsEpnWYQgshg Fisher&amp;#039;s Iris], [https://yadi.sk/d/RO3RX1PPP2XoSw COVID days since 50 confirmed cases], [https://yadi.sk/d/TdmY-qZt_Uu-pQ Iris depivot example]&lt;br /&gt;
&lt;br /&gt;
[https://yadi.sk/i/l2LhZSooi8RkEA Seminar 3. video.] [https://yadi.sk/d/BJw8HSYJMoC5qQ Handling categorical values]. [https://yadi.sk/d/AmL3HXa-fuxS9g Handling missing values]. [https://yadi.sk/d/STx2V0-rgYlybA Titanic prediction on train-test setting.]&lt;br /&gt;
&lt;br /&gt;
[https://youtu.be/lsK5rK7-WvI Seminar 4. video] RM processes [https://yadi.sk/d/BVRGT0fqpwdUTQ Walmart preprocessing] [https://yadi.sk/d/nw76wzn1x7e1VQ Walmart regression] [https://yadi.sk/d/ig5j_tHesiRg0A GridSearch] [https://docs.google.com/document/d/1Ptj7J1ikOVsuGmY5rPNFCo0p3hrDcLyQAwotDxxbBiQ/edit?usp=sharing Seminar plan]&lt;br /&gt;
&lt;br /&gt;
[https://youtu.be/mcaic7sgz3M Seminar 5. video.] [https://docs.google.com/presentation/d/1uCC1xNon8OpWg3jzRSXqaM8xpV11DFdGl8bWsUVJ-ho/edit?usp=sharing Seminar presentation.][https://yadi.sk/d/E4ToVUBnC4dong Telecom Churn data] [https://yadi.sk/d/jdv5incZra30Fw RM process]&lt;br /&gt;
&lt;br /&gt;
[https://youtu.be/vZYSp36_0o4 Seminar 6. video.] [https://docs.google.com/document/d/1Ptj7J1ikOVsuGmY5rPNFCo0p3hrDcLyQAwotDxxbBiQ/edit?usp=sharing Plan for the Seminar 6]  [https://yadi.sk/d/rl-7BVlMrhKVkA small_mnist] [https://yadi.sk/d/JPPZfxE1A0SWkg transactions]; RM processes: [https://yadi.sk/d/H1uzIN09w3FDfg digits_clustering_dr] [https://yadi.sk/d/EXrPElNnv4SMWA stores_clustering]&lt;br /&gt;
&lt;br /&gt;
[https://youtu.be/YOVGgGQq0rM Seminar 7. video.] [https://yadi.sk/d/IbzLqpgD0wLccw RM process]&lt;br /&gt;
&lt;br /&gt;
===Lecture&amp;#039;s materials===&lt;br /&gt;
&lt;br /&gt;
[https://www.youtube.com/playlist?list=PLriUvS7IljvlcLnrvYUyNc9nXhiM9kWjq Lecture&amp;#039;s youtube playlist]&lt;br /&gt;
&lt;br /&gt;
===Home assignments===&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#228B22&amp;quot;&amp;gt; Google doc with [https://docs.google.com/document/d/1Sk-nr5owlKf8MYgdmItit47WLBvWVzMCyc3G_sRLxfs/edit?usp=sharing Q&amp;amp;A] about Home Assignment tasks (contributed by students).&amp;lt;/span&amp;gt;&lt;br /&gt;
&lt;br /&gt;
[https://docs.google.com/spreadsheets/d/1jPHcaTL0CeeaF79VvNqFF3-RvS2XGERNY2dlWmV5aUs/edit#gid=1218360773 Google doc] with grades.&lt;br /&gt;
&lt;br /&gt;
1. Analyze COVID dataset &lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; due to Monday, April 27, &amp;lt;s&amp;gt;8 am&amp;lt;/s&amp;gt; 23:59 Moscow time. &amp;lt;/span&amp;gt;&lt;br /&gt;
* Home assignment [https://docs.google.com/document/d/1WMIyh9opbYrK7cWfwHiTcnhOud3l--Sh2VPeVF7QUbc/edit?usp=sharing description].&lt;br /&gt;
* Starter [https://yadi.sk/d/jWZDCPhOq_CbuA process.]&lt;br /&gt;
* [https://yadi.sk/d/SST3aJE6e4nSzQ Total Cases], [https://yadi.sk/d/pFx64HkmX93UWw Deaths], [https://yadi.sk/d/2cixg4bpOfKbMQ Recovered] on April 19, 2020.&lt;br /&gt;
* [https://docs.google.com/forms/d/e/1FAIpQLScCQkT4XB9z74r2mZMttY9IFqlQe0RTVgIU8pjj-u3bUBwGig/viewform?usp=sf_link Google form] to submit your solution.&lt;br /&gt;
* [https://yadi.sk/d/VwfTrIUMltfabA HA 1. Solution]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
2. Analyze Titanic dataset&lt;br /&gt;
&amp;lt;s&amp;gt;&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; due to Friday, May 8, 8 am Moscow time. &amp;lt;/span&amp;gt; (there will be no extensions)&amp;lt;/s&amp;gt; May 11, 8am, Moscow time.&lt;br /&gt;
* Home assignment [https://docs.google.com/document/d/1WMIyh9opbYrK7cWfwHiTcnhOud3l--Sh2VPeVF7QUbc/edit?usp=sharing description], starting with page 3.&lt;br /&gt;
* [https://yadi.sk/d/1d6rarln1ybqNQ Titanic dataset]&lt;br /&gt;
* [https://docs.google.com/forms/d/e/1FAIpQLSecCpGrn6e_j5KS8rRoKCMKy0Zy0f2wIaiMFxwHRWecxHH1Nw/viewform?usp=sf_link Google form] to submit your solution.&lt;br /&gt;
* [https://yadi.sk/d/5aCD_7VGo8GUdw HA 2. Solution]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
3. Analyze Walmart Sales dataset&lt;br /&gt;
&amp;lt;s&amp;gt;&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; due to Friday, May 25, 8 am Moscow time. &amp;lt;/span&amp;gt; (there will be no extensions)&amp;lt;/s&amp;gt; May 27, 23:59 Moscow time.&lt;br /&gt;
* Home assignment [https://docs.google.com/document/d/1WMIyh9opbYrK7cWfwHiTcnhOud3l--Sh2VPeVF7QUbc/edit?usp=sharing description], starting with page 6.&lt;br /&gt;
* [https://www.kaggle.com/c/walmart-recruiting-store-sales-forecasting/data Walmart data]&lt;br /&gt;
* [https://docs.google.com/forms/d/e/1FAIpQLSfgZl2UCwr8oDxfaeinV0xjlYCGjQ3OQWdbOeWxCHLQwP9_sA/viewform?usp=sf_link Google form] to submit your solution.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
4. Predict customers churn&lt;br /&gt;
&lt;br /&gt;
&amp;lt;s&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; due to Friday, June 5, 23:59 Moscow time. &amp;lt;/span&amp;gt; (there will be no extensions)&amp;lt;/s&amp;gt; Monday, June 8, 23:59 Moscow&lt;br /&gt;
* Home assignment [https://docs.google.com/document/d/1WMIyh9opbYrK7cWfwHiTcnhOud3l--Sh2VPeVF7QUbc/edit?usp=sharing description], starting with page 14.&lt;br /&gt;
* [https://yadi.sk/d/Fhs7pElrkdhW-w data] for the assignment.&lt;br /&gt;
* [https://docs.google.com/forms/d/e/1FAIpQLSfwilwkleJSqAB1F2OMPDgyGBQmC3Z4su9IdmJbI9NGNzyxGA/viewform?usp=sf_link Google form] to submit your solution.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
5. Cluster items and build a recommender system.&lt;br /&gt;
&lt;br /&gt;
&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; due to Wednesday, June 17, 23:59 Moscow time. &amp;lt;/span&amp;gt; &lt;br /&gt;
* Home assignment [https://docs.google.com/document/d/1WMIyh9opbYrK7cWfwHiTcnhOud3l--Sh2VPeVF7QUbc/edit?usp=sharing description], starting with page 19.&lt;br /&gt;
* [https://docs.google.com/forms/d/e/1FAIpQLSfnHymipXfjTTnrvat9vDOs5L6tQtA8WJY4JQ-7DTCY7pexyQ/viewform?usp=sf_link Google form] to submit your solution will be posted later.&lt;br /&gt;
&lt;br /&gt;
===Textbooks===&lt;br /&gt;
&lt;br /&gt;
*Provost, Foster, Fawcett, Tom. Data Science for Business: What you need to know about data mining and data-analytic thinking. O&amp;#039;Reilly Media, Inc.&amp;quot;, 2013.&lt;br /&gt;
*James, G. et al. An introduction to statistical learning. Springer, 2013.&lt;br /&gt;
*Siegel, E. Predictive analytics: The power to predict who will click, buy, lie, or die. John Wiley &amp;amp; Sons, 2016.&lt;br /&gt;
&lt;br /&gt;
===Software===&lt;br /&gt;
&lt;br /&gt;
*For online lectures and seminars. [https://zoom.us/ zoom]&lt;br /&gt;
*Modelling package. [https://rapidminer.com/ RapidMiner]&lt;br /&gt;
&amp;lt;br&amp;gt;&amp;lt;span style=&amp;quot;color:#DC143C&amp;quot;&amp;gt; Apply for educational version https://rapidminer.com/get-started-educational/ &amp;lt;/span&amp;gt; &lt;br /&gt;
&lt;br /&gt;
*Email: Enter your university email (end with @edu.hse.ru)&lt;br /&gt;
*Job Function: Student&lt;br /&gt;
*University: Higher School of Economics&lt;br /&gt;
*Course Name: Data Science for Business&lt;br /&gt;
*Course Number: https://www.hse.ru/edu/courses/341840822&lt;br /&gt;
*Course Term: Summer Term&lt;br /&gt;
*Professor: Leonid Zhukov&lt;/div&gt;</summary>
		<author><name>imported&gt;Kurmukovai</name></author>
	</entry>
</feed>