[Bug 55195] Invalid Title in flickrripper

2014-09-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

John Mark Vandenberg  changed:

   What|Removed |Added

 CC||jay...@gmail.com
  Component|General |Other scripts

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-12-31 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

xqt  changed:

   What|Removed |Added

 Status|ASSIGNED|NEW
 CC||i...@gno.de

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #8 from Kunal Mehta (Legoktm)  ---
Issue still not fixed, actually its worse
C:\Dev\SVN\pywikipedia>flickrripper.py -autonomous -user\_id:40561337@N07
-addcat
egory:"Files from Abhisit Vejjajiva Flickr stream"
5703017392
Traceback \(most recent call last\):
File "C:\Dev\SVN\pywikipedia\flickrripper.py", line 609, in 
main\(\)
File "C:\Dev\SVN\pywikipedia\flickrripper.py", line 599, in main
removeCategories, autonomous\)
File "C:\Dev\SVN\pywikipedia\flickrripper.py", line 257, in processPhoto
filename = getFilename\(photoInfo\)
File "C:\Dev\SVN\pywikipedia\flickrripper.py", line 172, in getFilename
% \(title, project, username\)\).exists\(\):
File "C:\Dev\SVN\pywikipedia\wikipedia.py", line 1284, in exists
self.get\(\)
File "C:\Dev\SVN\pywikipedia\wikipedia.py", line 705, in get
expandtemplates = expandtemplates\)
File "C:\Dev\SVN\pywikipedia\wikipedia.py", line 787, in \_getEditPage
raise BadTitle\('BadTitle: %s' % self\)
pywikibot.exceptions.BadTitle: BadTitle:
\[\[commons:File:&\#3609;&\#3634;&\#3618;&\#3
585;&\#3619;&\#3633;&\#3600;&\#3617;&\#3609;&\#3605;&\#3619;&\#3637;
&\#3649;&\#3621;&\#363
2;&\#3588;&\#3603;&\#3632;&\#3648;&\#3604;&\#3636;&\#3609;&\#3607;&\#3634;&\#3591;&\#3629;&
\#3629;&\#3585;&\#3592;&\#3634;&\#3585;&\#3585;&\#3619;&\#3640;&\#3591;&\#3592;&\#3634;&\#35
85;&\#3634;&\#3619;&\#3660;&\#3605;&\#3634;
&\#3626;&\#3634;&\#3608;&\#3634;&\#3619;&\#3603
;&\#3619;&\#3633;&\#3600;&\#3629;&\#3636;&\#3609;&\#3650;&\#3604;&\#3609;&\#3637;&\#3648;&\#
3595;&\#3637;&\#3618;&\#3585;&\#3621;&\#3633;&\#3610;&\#3618;&\#3633;&\#3591;&\#3611;&\#361
9;&\#3632;&\#3648;&\#3607;&\#3624;&\#3652;&\#3607;&\#3618;
&\#3623;&\#3633;&\#3609;&\#3629;
&\#3634;&\#3607;&\#3636;&\#3605;&\#3618;&\#3660;&\#3607;&\#3637;&\#3656;
8 &\#3614;&\#3620;
&\#3625;&\#3616;&\#3634;&\#3588;&\#3617;
&\#3614;.&\#3624;.2554 \(Photographer attached
to the Prime Minister of the Kingdom of Thailand \(H.E.Mr.Abhisit Vejjajiva\) ,
Pe
erapat Wimolrungkarat -
&\#3614;&\#3637;&\#3619;&\#3614;&\#3633;&\#3602;&\#3609;&\#3660;
&\#3623;&\#3636;&\#3617;&\#3621;&\#3619;&\#3633;&\#3591;&\#3588;&\#3619;&\#3633;&\#3605;&\#
3609;&\#3660;\) @is50mm - Flickr - Abhisit Vejjajiva.jpg\]\]

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #12 from Kunal Mehta (Legoktm)  ---
I do not see a conversion by the exception. I converted the title from html
entities to unicode in my last commit

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #13 from Kunal Mehta (Legoktm)  ---
Line 787 doesnt return the title, it returns the whole page \(self\) when you
print the object and not the title it gets converted there. I used a log to
confirm that the title was UTF-8 before filling this bug,

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #11 from Kunal Mehta (Legoktm)  ---
those are the thai parts of the page title that are being converted when the
exception is being thrown

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #6 from Kunal Mehta (Legoktm)  ---
- **status**: open --> pending

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #14 from Kunal Mehta (Legoktm)  ---
Thanks for testing. The lenght calculation was wrong. I've corrected it

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #10 from Kunal Mehta (Legoktm)  ---
Where are the html entities from? Are they part of the flickr page?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #9 from Kunal Mehta (Legoktm)  ---
- **status**: pending-fixed --> open

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #7 from Kunal Mehta (Legoktm)  ---
- **status**: pending --> pending-fixed

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

Kunal Mehta (Legoktm)  changed:

   What|Removed |Added

   See Also||https://sourceforge.net/p/p
   ||ywikipediabot/bugs/1466

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #2 from Kunal Mehta (Legoktm)  ---
an idea for getFilename \(could anybody test it whether it works\)

if not title:
\#find the max length for a mw title
maxBytes = 240 - len\(project.encode\('utf-8'\)\) \
\- len\(username.encode\('utf-8'\)\)
description = photoInfo.find\('photo'\).find\('description'\).text
if description:
descBytes = len\(description.encode\('utf-8'\)\)
if descBytes > maxBytes:
\# maybe we cut more than needed, anyway we do it
items = max\(0, len\(description\) - maxBytes + descBytes\)
description = description\[:items\]
title = cleanUpTitle\(description\)
else:
title = u''
\# Should probably have the id of the photo as last resort.

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #3 from Kunal Mehta (Legoktm)  ---
- **assigned_to**: nobody --> xqt

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #5 from Kunal Mehta (Legoktm)  ---
- **summary**: Invalid Title --> Invalid Title in flickrripper

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #1 from Kunal Mehta (Legoktm)  ---
I guess the title is cutted by mw and not the slice operator since it works
correct for unicode strings. len\(\) also gives the number of characters not
the number bytes. Do we have any size\(object\) method?

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 55195] Invalid Title in flickrripper

2013-10-04 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=55195

--- Comment #4 from Kunal Mehta (Legoktm)  ---
fix committed in r10387, please check

-- 
You are receiving this mail because:
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l