-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
have scrapped phone number?? from just dial #1
Comments
I was able to earlier, but it seems they have started sending svg image instead of numbers. |
@dvijparekh1995 However we can take the class name and map it from there. But this will break when they change it again. |
There is a series used by JD to show phone number. Series is as below |
Yes, I had that in mind. But the issue is they can change the class name whenever they want and this will break then. Better to think of something concrete. The most foolproof solution is to use digit recognition on the image. |
yes i think the same. as the will surely change it. |
I'm not getting the phonenumbers. Can you tell me how to get phone numbers |
@krishnamalireddy JD is now using svg's in place of actual numbers. That's why parsing is getting failed. There are couple of ways to get around this. Each svg's has a unique code which can be mapped - will fail if they change mapping again Unfortunately I am not getting time to develop this. Will pick it up whenever I have some bandwidth. |
@Alankar0416 Could you please demonstrate, how can we implement the numbers from svgs in code? |
simple solution is instead of using .string use .find_all for phone number. You will get random code of svg's convert them |
The issue is we can to keep a map of svg code and number but it JD can change it anytime. |
Ha they can change it any time. If they have changed we have to decode it again. By the way they haven't changed it for a long time |
Thanks @Alankar0416 for sharing the code. Here is an array mapping I've used as a second pass on the csv file. I used the .find_all for phone number.
Attached is my php code. |
Great work @ketanshah79 |
Yes I did get it to work. Try running it on any CSV generated by your
python script
Thank you
…On Fri, Feb 8, 2019, 4:01 PM Alankar Gupta ***@***.*** wrote:
Great work @ketanshah79 <https://github.com/ketanshah79>
Haven't tried this code. Are you able to successfully map phone numbers
with this additional script? If yes, I can add this into the original
script to make things easy for everyone.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAHRkkpVyeYgaEtICQ55nPEWZ6e8h3fSks5vLVIMgaJpZM4Si73y>
.
|
only 10 data retrieving |
@Alankar0416 could you please post the code along with @ketanshah79 's changes? Thanks! |
@mps1305 check my forked repo i have made changes accordingly and its working just change url whichever you want |
hey @dvijparekh , it was working up until sometime back. then started getting this error. Any help in this regard would be highly appreciated! |
it seems like justdial is blocking scraper to scrape working on it |
Hey, I have written a script that will scrape phone numbers from any JustDail Business page. Please try this: PS: it also scrapes other info along with Phone numbers. |
I am getting urllib open timeout error. Is this code still working for anyone? |
can anyone update latest code here? |
The phone number is not correct
…On Thu, 6 Feb 2020 at 8:50 PM, Suhail Saifi ***@***.***> wrote:
can anyone update latest code here?
try this:
https://github.com/SuhailSaify/Justdial-Scrapper
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
<#1?email_source=notifications&email_token=ANUP7F2ZKST7WC5PIBEKJEDRBQTB7A5CNFSM4EULXXZKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOEK7S45I#issuecomment-582954613>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/ANUP7FZWFKCBGAZTVAJSZYTRBQTB7ANCNFSM4EULXXZA>
.
|
I am about to solve this issue, can anyone help me with this error - https://stackoverflow.com/questions/60875316/typeerror-string-indices-must-be-integers-when-getting-class-fro-span-tag-using |
please share link url of just dial you are trying to scrape |
Solved it brother. Thank you. |
There is another error though |
it means it is not able to find span tag having class mrehover so body.find is returning none which doesnt have any method or attribute text()
|
Hey, use this method https://youtu.be/EkbF5JwuHqU |
No description provided.
The text was updated successfully, but these errors were encountered: