Not seems to work in aspx page #24

tanayz · 2015-08-14T04:46:05Z

Rodric,

Great work and thanks for sharing.I'm mainly trying to extract the main text from a news link and it seems to work in most of the sites except for aspx pages.In aspx it's only giving the meta-information such as copyright info.

For example
import eatiht

url='http://www.fool.com/investing/general/2015/08/12/this-startup-is-bigger-than-microsoft-corporation.aspx'

eatiht.extract(url)
Out[37]: u'\n Copyright, Trademark and Patent Information Terms of Use Please read our Terms and Conditions\n \xa9 1995 - 2015 The Motley Fool. All rights reserved. \n\n\n BATS data provided in real-time. NYSE, NASDAQ and NYSEMKT data delayed 15 minutes. Real-Time prices provided by BATS BZX. Market data provided by Interactive Data. Company fundamental data provided by Morningstar. Earnings Estimates, Analyst Ratings and Key Statistics provided by Zacks. SEC Filings and Insider Transactions provided by Edgar Online. Powered and implemented by Interactive Data Managed Solutions.

rodricios · 2015-08-15T03:15:57Z

Hi @tanayz, I will look into this issue. Thanks for bringing it up 😄

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Not seems to work in aspx page #24

Not seems to work in aspx page #24

tanayz commented Aug 14, 2015

rodricios commented Aug 15, 2015

Not seems to work in aspx page #24

Not seems to work in aspx page #24

Comments

tanayz commented Aug 14, 2015

rodricios commented Aug 15, 2015