Quantcast
Channel: feed2exec:c3b4db038edda84f2331492e3424e2cfd2854d70 commits
Browsing latest articles
Browse All 40 View Live

ignore more coverage files

View Article



prepare 0.13.0 release

View Article

fix commented out debugging statement, no change

View Article

warn instead of crashing on invalid dates

We used to completely crash when a feed had invalid or missing dates. After reviewing the standards, it turns out this is not quite valid behavior: RSS 0.90 and 0.91, for example, do not have dates at...

View Article

expand and cleanup date validation routines

We used to have a long list of fields. Now we just iterate over the item, then the feed, and look for the fields we want. It's cleaner visually and might even make some feeds validate, as we now look...

View Article


use dateparser module if available

This behaves better than the feedparser date parser in most scenarios. Still needs unit tests and dependency checks. Closes: #6

View Article

handle broken pipe correctly from plugins

Before this, doing "feed2exec parse foo | head" would yield an error message for *every* feed item. This silences the warnings completely.

View Article

add JSON output plugin

This plugin is very simple, if not trivial: it simply dumps all the feed items in a JSON stream. This can be parsed by `jq` on the commandline to diagnose feed problems, do scripting or whatever. This...

View Article


recommend the dateparser module for better dates handling

View Article


bump standards, no change

View Article

prepare 0.14.0 release

View Article

switch to a single channel for all my projects

View Article

play catchup with newer html2text version again

View Article


workaround timezone parsing bug in dateparser

View Article

switch to dateparser for PyPI and tests as well

Because the Debian package recommends dateparser, I had different results running tests natively on Debian and within tox. This harmonizes things and makes use of dateparser everywhere, warts and...

View Article


register properly the special mark we are using

View Article

reduce noise level of 'missing time' problems

It seems like previous versions of feedparser would never trigger that problem, and would fill in the date instead. Now I have feeds that have this on *every* item and it generates a lot of noise in...

View Article


prepare 0.15.0 release

View Article

split large feeds.py into model.py and controller.py

This rearchitecture seems to make sense to me. I would like to keep those files smaller and that naming will force me to follow that model/controller distinction more clearly. Already, the parse/fetch...

View Article

rename FeedCacheStorage to FeedItemCacheStorage

This is, effectively, a per-item cache, not a full feed cache. We want to implement the latter as part of #10 so it makes sense to rename this first. This is an API breaking change.

View Article

factor out getter/setters in the base sqlite class

This should pave the way for reusing this class in a caching backend like cachecontrol.

View Article


use context manager for connexions, inspired by cachecontrol_sqlite

View Article


remove conn member in sqlite to force use of context manager

View Article

split connection cache logic in a separate function, to simplify

View Article

implement thread-level locking

This is not absolutely necessary as we don't do thread-level parallelism. But if we every want to switch back to doing that, this is an elegant way of supporting that. Inspired by cachecontrol-sqlite.

View Article


enforce commit in context manager unless explicitely disabled

This makes sure we never, ever forget to commit unless we *explicitely* disable it. This is also inspired by cachecontrol-sqlite, except the latter uses False as a default for the autocommit, which...

View Article

small simplification of session init logic

View Article

first attempt at using cachecontrol, failing

It seems we need to provide the timestamp, and it doesn't store it in the database, so it doesn't send if-modified-since headers, so it fails. Maybe we are better off implementing this on our own?

View Article

fix broken cache adapter support

We did not need to pass the if-modified-since header. All that was needd was that we lookup (and return!) the cache value properly. So also remove that from the database. The way things were setup,...

View Article



forcibly preset the builtin feed session

Without this setting, the wrong session gets initialized in the new Feed object. Before the caching layer was implemented, this didn't matter much because those sessions were never called. But since...

View Article

install python3-dev, required for compiling regex

Not sure why all that junk is necessary, but I want to fix the build.

View Article

avoid newer feedparser versions

feedparser 6.0+ removed the FeedParserDict which we depend on: https://github.com/kurtmckee/feedparser/issues/197 Until we refactor the Feed class, stick with older versions of feedparser.

View Article

also test on python 3.8, now available in python:latest

View Article


run apt update before install

View Article

Merge remote-tracking branch 'origin/cachecontrol'

View Article

update supported Python release numbers

View Article


sync up tox.ini and setup.py version lists

View Article


add missing cachecontrol dep

View Article

move session and fetching to the feed manager

Having the session and the network code in the "model" makes no sense: that stuff belongs in the "controller". Having it there made it particularly difficult to implement the caching layer, as I had...

View Article

remove class-level sticky session parameter

This cleans up a lot of stuff. Now we can treat the session as a normal feed_manager parameter. Since there is usually only one feed_manager in operation at any time, it is basically a static member....

View Article
Browsing latest articles
Browse All 40 View Live


Latest Images