In [1]:
!date # written on, coded on the day before
Sun Sep  8 07:00:59 CDT 2019
In [1]:
import feedparser
In [2]:
feed = feedparser.parse("")
In [3]:
 'namespaces': {'': ''}}
In [4]:
feed_title = feed['feed']['title']
In [5]:
"It's A Digital Disease!"

Great place to start.


What the heck is happening up there? Does anyone read .rss? I admittedly don't. I know I see the buttons at the bottom of articles sometimes but I would personally say that the most social sharing of content I participate in is stuff I see on Twitter and whatever part of social interaction is part of considering what's on my feed on Youtube.


I kind of figured out one goal that I want to work towards and I know what that's going to be.

I want to do some friggin parsing.

What's parsing?

(I'm secretly inserting techy vocab words into your concious)

It's taking that mumbo jumbo up there and making sense of it. I wouldn't go so far as to say it is translation but it's a data signal and parsing is a technique/tool you build to understand it.

But to answer the ORIGINAL question: parsing is what you do to understand it. it's THE VERB. ya know?

warning. this post will probably get long.

I'm going to construct my owwwwn parser. And try to learn how to do regex at the same time.


omg what is regex. another effing vocab word. it's like the holy grail of all parsers but you seriously can't build a parser with out it right? Like. omg.

Basically regex is a way to understand something from whatever kind of text ever.

Also here are a bunch of other parsers. Looks like a common project:
(I'm going to start sharing links that you can copy and paste so that you don't get into the habit of just CLICKING ON LINKS. Don't just click on stuff. That's yucky. Also know that I'm not judging you if you do do that but I also I literally just said "do do" decide)

No more tell. Here's it is:

In [6]:
import re # yes Rhianna. See - all rap/hip hop IS programming.
from bs4 import BeautifulSoup # yeah that's hilarious. import bs.
In [7]:
# mid post thought: I kind of want to write short halloween scary stories. 
# I might hide them.. 
# I might hide them..
In [8]:
end_style_tag = re.compile(r'</style>')
In [9]:
with open("Untitled1.html") as html_file:
    soup = BeautifulSoup(html_file)

I might literally write this program line by line so keep that in mind as you're reading this. I'll try to step through my thinking but right now what I want to do is find where the html begins for a jupyter notebook file (the enivornment that I use to program and you should too if you program in python) that has been converted into an html file.

it's a body of text. that's it.</p></blockquote>

I had this written yesterday. Here's the problem with same day posts in programming. Most of what I am doing on a daily basis is like exploration if I really want to get some real learning done. Most of what I do doesn't yield results like most developers and I'm kind of okay with that. Pulling data is more important to me.

What I was actually trying to find out here was this:

when I am done writing these blog posts I go back into my terminal/command line/command prompt whatever you know it as, I type jupyter nbconvert this_note_book_file.ipynb and then I move the .ipynb file that is linked in the .gitignore file:~:

(heh I'm not sure you can see that on mobile.

anyway. I'll test it out.)

just so everyone is aware of this - there are certain features that I have implemented with the desktop version of this site that I am not able to reproduce when it comes to mobile and the reverse is true as well which totally ruins the balance of experiencing my website

:~: (see the tethers?) then there is a .html version of this file that is named the title of this page which I put into the blog/ folder which is why the url to this page is

Alrighty. Day 10 post to come. More on Scrapy and why it's lit.

In [10]:
<p>Great place to start.</p>