This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
start [2017/10/27 08:25] zoza [Scraping and mining Dezeen articles] |
start [2018/04/26 13:46] (current) zoza [Python and SOM] |
||
---|---|---|---|
Line 1: | Line 1: | ||
====== POSTDOCTORAL RESERACH ====== | ====== POSTDOCTORAL RESERACH ====== | ||
+ | |||
+ | ===== Python and SOM ===== | ||
+ | |||
+ | - python module by Vahid Moosavi of CAAD, **sompy** | ||
+ | |||
+ | - another SOM python implementation, **somoclu**: https://somoclu.readthedocs.io/en/stable/index.html | ||
+ | |||
+ | - SOM Java Toolbox created at TU Wien http://www.ifs.tuwien.ac.at/dm/somtoolbox/ | ||
+ | |||
+ | - Twitter sentiment analysis with Python: https://towardsdatascience.com/another-twitter-sentiment-analysis-with-python-part-5-50b4e87d9bdd | ||
===== Scraping and mining twitter streams ===== | ===== Scraping and mining twitter streams ===== | ||
Line 32: | Line 42: | ||
It is also important to declare fields in pages that will be scraped. This is done in the dezeen/items.py file, using eg (the Class is already declared when you start project). | It is also important to declare fields in pages that will be scraped. This is done in the dezeen/items.py file, using eg (the Class is already declared when you start project). | ||
- | <code>Class DezeenItem(Item): | + | <code python>Class DezeenItem(Item): |
title = Field() | title = Field() | ||
link = Field() | link = Field() | ||
Line 38: | Line 48: | ||
</code> | </code> | ||
- | These fields will be later used as part of the item dictionary (e.g. item['link'] | + | These fields will be later used as part of the item dictionary (e.g. item['link']) |
====== DOCTORAL RESEARCH ====== | ====== DOCTORAL RESEARCH ====== |