-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
'charmap' codec can't encode character '\u03bc' in position 10: character maps to <undefined> #58
Comments
This problem is not caused by #57, I will check this tomorrow. |
close: mediawiki-client-tools#58 not tested
Replace it whith This problem may only occur on windows, I can't test it. |
It's still broken, sorry. |
Also the log file looks a little different to stdout. Maybe revert the changes since PR #56 ? |
Can you provide the .log file? |
In the terminal:
In the file:
|
This is expected behavior, not a bug. Because currently tee() does nothing more than simply redirect the stdout to a file by using |
line 42 changed to |
It works without a problem on Kubuntu. It also writes a file Special:Version.html Windows chokes on the colon in the filename. This isn't a new problem, I've only just noticed it. |
|
|
Yes, it makes an empty file named Special The following characters are not allowed in NTFS file names:
|
NTFS: Most linux fs: Illegal characters in page titles (MediaWiki): Illegal characters in file names (MediaWiki): Remove the
So, maybe it's not a good idea to use wikiteam to dump images on NTFS ... |
Maybe we can rename |
If I extract |
There's some other problem, without trying to get
|
I forgot to mention that this is caused by another bug in --xmlrevision, and currently wikiteam3 (which also exists in upstream) can't handle the revision being hidden/deleted perfectly. There is a simple patch that skips the whole page crawl when carwler see a hidden revision, but I didn't include the patch in the 0.5.0-alpha PR because it's not a perfect solution, and I'm working on fixing it. |
Try this, no hidden reversion on the site: |
|
I did some tests, |
XML dump seems okay
However one file didn't download. There's an errors.log file with:
|
File naming involves many issues.
Also, I found that wikiteam has problems with truncating long filenames. Again, I will discus and fix these problems later. |
I tried it out with a wiki with hidden revisions and can confirm |
It now gets past where it previously failed on NTFS (over "Fairchild μA726"). |
#64 to change Special:Version.html to Special-Version.html |
@yzqzss After PR #57 the XML dump fails
snipped
snipped
The text was updated successfully, but these errors were encountered: