标签归档:python

python去/替标点总结

首先是英文标点:
import re, string, timeit

s = “string. With. Punctuation”
exclude = set(string.punctuation)
table = string.maketrans(“”,””)
regex = re.compile(‘[%s]’ % re.escape(string.punctuation))

def test_set(s):
return ”.join(ch for ch in s if ch not in exclude)

def test_re(s): # From Vinko’s solution, with fix.
return regex.sub(”, s)

def test_trans(s):
return s.translate(table, string.punctuation)

def test_repl(s): # From S.Lott’s solution
for c in string.punctuation:
s=s.replace(c,””)
return s

print “sets :”,timeit.Timer(‘f(s)’, ‘from __main__ import s,test_set as f’).timeit(1000000)
print “regex :”,timeit.Timer(‘f(s)’, ‘from __main__ import s,test_re as f’).timeit(1000000)
print “translate :”,timeit.Timer(‘f(s)’, ‘from __main__ import s,test_trans as f’).timeit(1000000)
print “replace :”,timeit.Timer(‘f(s)’, ‘from __main__ import s,test_repl as f’).timeit(1000000)
速度对比:
sets : 19.8566138744
regex : 6.86155414581
translate : 2.12455511093
replace : 28.4436721802

 

然后是中文标点:

re.sub(u”[\uFF00-\uFFEF]+” ,’ ‘,str.decode(‘utf8’))

 

参考地址:

http://stackoverflow.com/questions/265960/best-way-to-strip-punctuation-from-a-string-in-python

http://www.unicode.org/charts/PDF/UFF00.pdf

 

Python version 2.7 required, which was not found in the registry

安装setuptools的时候,不能再注册表中识别出来python2.7

在网上找了方法,仅作笔记,供下次使用

 

方法:

 

新建一个register.py 文件,把一下代码贴进去,保存

 

 

#
# script to register Python 2.0 or later for use with win32all
# and other extensions that require Python registry settings
#Python version 2.7 required, which was not found in the registry
# written by Joakim Loew for Secret Labs AB / PythonWare
#
# source:
# http://www.pythonware.com/products/works/articles/regpy20.htm
#
# modified by Valentine Gogichashvili as described in http://www.mail-archive.com/distutils-sig@python.org/msg10512.html
 
import sys
 
from _winreg import *
 
# tweak as necessary
version = sys.version[:3]
installpath = sys.prefix
 
regpath = "SOFTWARE\\Python\\Pythoncore\\%s\\" % (version)
installkey = "InstallPath"
pythonkey = "PythonPath"
pythonpath = "%s;%s\\Lib\\;%s\\DLLs\\" % (
    installpath, installpath, installpath
)
 
def RegisterPy():
    try:
        reg = OpenKey(HKEY_CURRENT_USER, regpath)
    except EnvironmentError as e:
        try:
            reg = CreateKey(HKEY_CURRENT_USER, regpath)
            SetValue(reg, installkey, REG_SZ, installpath)
            SetValue(reg, pythonkey, REG_SZ, pythonpath)
            CloseKey(reg)
        except:
            print "*** Unable to register!"
            return
        print "--- Python", version, "is now registered!"
        return
    if (QueryValue(reg, installkey) == installpath and
        QueryValue(reg, pythonkey) == pythonpath):
        CloseKey(reg)
        print "=== Python", version, "is already registered!"
        return
    CloseKey(reg)
    print "*** Unable to register!"
    print "*** You probably have another Python installation!"
 
if __name__ == "__main__":
    RegisterPy()

运行,显示“python 2.7 is already registered”

再安装setuptools的时候,就能自动识别出来python2.7了。

win7是 64的原因,在安装python(32位)时,如果选择只为当前用户,以上问题是不会出现的,如果选择所有用户,那就用上面的方法解决吧。