矢量化的字符串方法

来源:teadocs 浏览 372 扫码分享 2020-12-06 12:34:59

矢量化的字符串方法

矢量化的字符串方法

Series is equipped with a set of string processing methods that make it easy to operate on each element of the array. Perhaps most importantly, these methods exclude missing/NA values automatically. These are accessed via the Series’s str attribute and generally have names matching the equivalent (scalar) built-in string methods. For example:

In [305]: s = pd.Series(['A', 'B', 'C', 'Aaba', 'Baca', np.nan, 'CABA', 'dog', 'cat'])
In [306]: s.str.lower()
Out[306]: 
0       a
1       b
2       c
3    aaba
4    baca
5     NaN
6    caba
7     dog
8     cat
dtype: object

Powerful pattern-matching methods are provided as well, but note that pattern-matching generally uses regular expressions by default (and in some cases always uses them).

Please see Vectorized String Methods for a complete description.

当前内容版权归 teadocs 或其关联方所有，如需对内容或内容相关联开源项目进行关注与资助，请访问 teadocs .

本文档使用 BookStack 构建