Make doc2vec imdb ipynb tutorial run in python 2 and 3 #1220

robotcator · 2017-03-18T00:11:54Z

fix the compatibility between python2 and python3 for the notebook of doc2vec-IMDB.ipynb #1139.

tmylk · 2017-03-19T01:02:50Z

Please merge in develop into your branh to resolve the conflicts git fetch;git merge develop

robotcator · 2017-03-19T08:44:45Z

It seems that I select the wrong base branch. I have change to the develop branch and the conflicts were resolved. And the commit( 1aa3f33) was my operation mistake.

piskvorky · 2017-04-09T07:27:02Z

docs/notebooks/doc2vec-IMDB.ipynb

@@ -92,8 +116,7 @@
    "        txt_files = glob.glob('/'.join([dirname, fol, '*.txt']))\n",
    "\n",
    "        for txt in txt_files:\n",
-    "            with open(txt, 'r', encoding='utf-8') as t:\n",
-    "                control_chars = [chr(0x85)]\n",
+    "            with codecs.open(txt, 'r', encoding='utf-8') as t:\n",


Use smart_open instead: drop codecs, open files in binary mode and convert content to unicode explicitly.

Ok, I will drop the codecs and move to smart_open.

piskvorky · 2017-04-09T07:27:18Z

docs/notebooks/doc2vec-IMDB.ipynb

@@ -104,21 +127,28 @@
    "            temp += \"\\n\"\n",
    "\n",
    "        temp_norm = normalize_text(temp)\n",
-    "        with open('/'.join([dirname, output]), 'w', encoding='utf-8') as n:\n",
+    "        with codecs.open('/'.join([dirname, output]), 'w', encoding='utf-8') as n:\n",


Not portable -- please use os.path.join.

piskvorky · 2017-04-09T07:27:49Z

docs/notebooks/doc2vec-IMDB.ipynb

    "            n.write(temp_norm)\n",
    "\n",
    "        alldata += temp_norm\n",
    "\n",
-    "    with open('/'.join([dirname, 'alldata-id.txt']), 'w', encoding='utf-8') as f:\n",
+    "    with codecs.open('/'.join([dirname, 'alldata-id.txt']), 'w', encoding='utf-8') as f:\n",


Drop codecs, use binary mode.

robotcator added 2 commits March 17, 2017 22:53

fix the compatibility between python2 & 3

1aa3f33

fix the compatibility between python2 & 3

bf48e8d

robotcator mentioned this pull request Mar 18, 2017

[WIP][DNM] error-resistant train(). Fix #1052 #1139

Closed

1 task

robotcator changed the base branch from doc_fix to develop March 19, 2017 08:33

tmylk changed the title ~~Fix notebook~~ Make doc2vec imdb ipynb tutorial run in python 2 and 3 Mar 20, 2017

tmylk merged commit 854fad6 into piskvorky:develop Mar 20, 2017

piskvorky reviewed Apr 9, 2017

View reviewed changes

robotcator deleted the fix-notebook branch June 1, 2017 01:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make doc2vec imdb ipynb tutorial run in python 2 and 3 #1220

Make doc2vec imdb ipynb tutorial run in python 2 and 3 #1220

robotcator commented Mar 18, 2017

tmylk commented Mar 19, 2017

robotcator commented Mar 19, 2017

piskvorky Apr 9, 2017

robotcator Apr 11, 2017

piskvorky Apr 9, 2017 •

edited

Loading

piskvorky Apr 9, 2017

Make doc2vec imdb ipynb tutorial run in python 2 and 3 #1220

Make doc2vec imdb ipynb tutorial run in python 2 and 3 #1220

Conversation

robotcator commented Mar 18, 2017

tmylk commented Mar 19, 2017

robotcator commented Mar 19, 2017

piskvorky Apr 9, 2017

Choose a reason for hiding this comment

robotcator Apr 11, 2017

Choose a reason for hiding this comment

piskvorky Apr 9, 2017 • edited Loading

Choose a reason for hiding this comment

piskvorky Apr 9, 2017

Choose a reason for hiding this comment

piskvorky Apr 9, 2017 •

edited

Loading