-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathProject3.html
More file actions
132 lines (110 loc) · 5.63 KB
/
Project3.html
File metadata and controls
132 lines (110 loc) · 5.63 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
<!DOCTYPE HTML>
<!--
Massively by HTML5 UP
html5up.net | @ajlkn
Free for personal and commercial use under the CCA 3.0 license (html5up.net/license)
-->
<html>
<head>
<title>Vadim Makeev - Local Scraper</title>
<meta charset="utf-8" />
<meta name="viewport" content="width=device-width, initial-scale=1, user-scalable=no" />
<link rel="stylesheet" href="assets/css/main.css" />
<noscript><link rel="stylesheet" href="assets/css/noscript.css" /></noscript>
<link href="//netdna.bootstrapcdn.com/font-awesome/3.2.1/css/font-awesome.css" rel="stylesheet">
</head>
<body class="is-preload">
<!-- Wrapper -->
<div id="wrapper">
<!-- Header -->
<header id="header">
<!-- <a href="index.html" class="logo">Excel Project, Israeli Knesset</a> -->
</header>
<!-- Nav -->
<nav id="nav">
<ul class="links">
<li><a href="index.html">Projects</a></li>
<li><a href="Project1.html">The Israeli Knesset by generations</a></li>
<li><a href="Project2.html">Israel Geo Data</a></li>
<li class="active"><a href="Project3.html">Scraper</a></li>
<li><a href="Project4.html">SQL</a></li>
<li><a href="Project5.html">R</a></li>
<li><a href="About.html">About</a></li>
<!-- Reference Page <li><a href="elements.html">Elements Reference</a></li>-->
</ul>
<ul class="icons">
<li><a target="_blank" rel="noopener noreferrer" href="https://www.linkedin.com/in/vadim-makeev-8891b1251/" class="icon-linkedin-sign icon-large"></span></a></li>
<li><a target="_blank" rel="noopener noreferrer" href="https://github.com/VadimM91?tab=repositories" class="icon-github icon-large"></i></span></a></li>
</ul>
</nav>
<!-- Main -->
<div id="main">
<!-- Post -->
<section class="post">
<header class="major">
<!-- <span class="date">April 25, 2017</span> -->
<h1>Local Scraper <br> Israeli Knesset<br />
</h1>
<p>A Python Decompiler from HTML-TXT to CSV an SQL Database</p>
</header>
<div class="image main"><img src="images/2.scraper_main.png" alt="" /></div>
<!-- <p>Paragraph 1 Example: here it goes - Donec eget ex magna. Interdum et malesuada fames ac ante ipsum primis in faucibus. Pellentesque venenatis dolor imperdiet dolor mattis sagittis. Praesent rutrum sem diam, vitae egestas enim auctor sit amet. Pellentesque leo mauris, consectetur id ipsum sit amet, fergiat. </p> -->
<h2> About This Image</h2>
<p>This is a snippet of my local Python HTML-TXT to CSV SQL Database Decompiler of the Official Israeli Knesset website.
The code extracts the years of birth of one Knesset cycle, combined with the rest of the code we can obtain a complete
database of the Knesset.
</p>
<h2>About This Project</h2>
<p>Aiming to streamline the process of creating a database of The Israeli Knesset Members, and after many failed attempts to
scrape the Israeli Knesset website online, I decided to build a local offline HTML scraper to obtain the data. </p>
<p>I developed a program that extracts a CSV file with the full names, date of birth, and the candidacy of the
Israeli Knesset members from the Official Israeli Knesset website by year of Knesset.</p>
<p>The CSV file is then imported into an SQL Database for further manipulations in Excel and R to produce
visualizations of different generations in each Israeli Knesset cycle. </p>
<p>Working on this project allowed me to hone my skills with Python and HTML on an official government website,
overcoming many challenges while making sure that the final program will be 100% accurate and dynamic.</p>
</p>
<p>The project is available on <a target="_blank" rel="noopener noreferrer" href="https://github.com/VadimM91/Knesset_Scraper">my GitHub</a>.
</p>
<p>For more works visit <a href="index.html">My Website</a>.
</p>
</section>
</div>
<!-- Footer -->
<footer id="footer">
<section class="split contact">
<section class="alt">
<h3>Location</h3>
<p>Tel Aviv, Israel</p>
</section>
<section>
<h3>Email</h3>
<p><a href=mailto:vadim.makeev.91@gmail.com>vadim.makeev.91@gmail.com</a></p>
</section>
<section>
<h3>Social</h3>
<ul class="icons alt">
<li><a target="_blank" rel="noopener noreferrer" href="https://www.linkedin.com/in/vadim-makeev-8891b1251/" class="icon-linkedin-sign icon-large"></span></a></li>
<li><a target="_blank" rel="noopener noreferrer" href="https://github.com/VadimM91?tab=repositories" class="icon-github icon-large"></i></span></a></li>
</ul>
</section>
</section>
</footer>
<header id="header">
<a href="index.html" class="logo">All my Projects</a>
</header>
<!-- Copyright
<div id="copyright">
<ul><li>© Untitled</li><li>Design: <a href="https://html5up.net">HTML5 UP</a></li></ul>
</div>-->
</div>
<!-- Scripts -->
<script src="assets/js/jquery.min.js"></script>
<script src="assets/js/jquery.scrollex.min.js"></script>
<script src="assets/js/jquery.scrolly.min.js"></script>
<script src="assets/js/browser.min.js"></script>
<script src="assets/js/breakpoints.min.js"></script>
<script src="assets/js/util.js"></script>
<script src="assets/js/main.js"></script>
</body>
</html>