Skip to content

Commit

Permalink
Fixes pagination detection in Tumblr likes.
Browse files Browse the repository at this point in the history
Fixes the pagination detection in the Tumblr likes crawler for blogs that contain a dash in the name.
  • Loading branch information
johanneszab committed Oct 11, 2018
1 parent d920f6e commit 6e823a0
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -215,7 +215,7 @@ private long ExtractNextPageLink(string document)
// <a id="next_page_link" href="/liked/by/wallpaperfx/page/5/1457139681" class="next button chrome blue">Next</a></div></div>

long unixTime = 0;
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.]+/([0-9]+)/([A-Za-z0-9]+))\"";
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.-]+/([0-9]+)/([A-Za-z0-9]+))\"";
long.TryParse(Regex.Match(document, pagination).Groups[3].Value, out unixTime);
return unixTime;
}
Expand Down

0 comments on commit 6e823a0

Please sign in to comment.