-
-
Notifications
You must be signed in to change notification settings - Fork 851
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
2 changed files
with
199 additions
and
11 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,184 @@ | ||
|
||
# PinYin for Han (汉字) | ||
|
||
--- | ||
|
||
[![NPM version][npm-badge]][npm-url] | ||
[![spm version][spm-badge]][spm-url] | ||
[![Build Status][travis-badge]][travis-url] | ||
[![Coverage Status][coveralls-badge]][coveralls-url] | ||
|
||
[npm-badge]: https://img.shields.io/npm/v/pinyin.svg?style=flat | ||
[npm-url]: https://www.npmjs.com/package/pinyin | ||
[spm-badge]: http://spmjs.io/badge/pinyin | ||
[spm-url]: http://spmjs.io/package/pinyin | ||
[travis-badge]: https://travis-ci.org/hotoo/pinyin.svg | ||
[travis-url]: https://travis-ci.org/hotoo/pinyin | ||
[coveralls-badge]: https://coveralls.io/repos/hotoo/pinyin/badge.svg?branch=master | ||
[coveralls-url]: https://coveralls.io/r/hotoo/pinyin | ||
|
||
[中文文档](README.md) | ||
|
||
Convert Han to pinyin. useful for phonetic notation, sorting, and searching. | ||
|
||
> Note: This module both support Node and Web browser. | ||
> | ||
> Python version see [mozillazg/python-pinyin](https://github.com/mozillazg/python-pinyin) | ||
--- | ||
|
||
## Feature | ||
|
||
* Segmentation for heteronym words. | ||
* Support Traditional and Simplified Chinese. | ||
* Support multiple pinyin style. | ||
|
||
## Install | ||
|
||
via npm: | ||
|
||
```bash | ||
npm install pinyin | ||
``` | ||
|
||
via spm: | ||
|
||
```bash | ||
spm install pinyin | ||
``` | ||
|
||
## Usage | ||
|
||
for developer: | ||
|
||
```js | ||
var pinyin = require("pinyin"); | ||
|
||
console.log(pinyin("中心")); // [ [ 'zhōng' ], [ 'xīn' ] ] | ||
console.log(pinyin("中心", { | ||
heteronym: true // Enable heteronym mode. | ||
})); // [ [ 'zhōng', 'zhòng' ], [ 'xīn' ] ] | ||
console.log(pinyin("中心", { | ||
heteronym: true, // Enable heteronym mode. | ||
segment: true // Enable Chinese words eegmentation, fix most heteronym problem. | ||
})); // [ [ 'zhōng' ], [ 'xīn' ] ] | ||
console.log(pinyin("中心", { | ||
style: pinyin.STYLE_INITIALS, // Setting pinyin style. | ||
heteronym: true | ||
})); // [ [ 'zh' ], [ 'x' ] ] | ||
``` | ||
|
||
for cli: | ||
|
||
```bash | ||
$ pinyin 中心 | ||
zhōng xīn | ||
$ pinyin -h | ||
``` | ||
|
||
## API | ||
|
||
### `<Array> pinyin(words[, options])` | ||
|
||
Convert Han (汉字) to pinyin. | ||
|
||
`options` argument is optional, for sepcify heteronym mode and pinyin styles. | ||
|
||
Return a `Array<Array<String>>`. If one of Han is heteronym word, it would be | ||
have multiple pinyin. | ||
|
||
## Options | ||
|
||
#### `<Boolean> options.segment` | ||
|
||
Enable Chinese word segmentation. Segmentation is helpful for fix heteronym | ||
problem, but performance will be more slow, and need more CPU and memory. | ||
|
||
Default is `false`. | ||
|
||
### `<Boolean> options.heteronym` | ||
|
||
Enable or disable heteronym mode. default is disabled, `false`. | ||
|
||
### `<Object> options.style` | ||
|
||
Specify pinyin style. please use static properties like `STYLE_*`. | ||
default is `.STYLE_TONE`. see Static Property for more. | ||
|
||
## Static Property | ||
|
||
### `.STYLE_NORMAL` | ||
|
||
Normal mode. | ||
|
||
Example: `pin yin` | ||
|
||
### `.STYLE_TONE` | ||
|
||
Tone style, this is default. | ||
|
||
Example: `pīn yīn` | ||
|
||
### `.STYLE_TONE2` | ||
|
||
Another tone style, use postfix number [0-4]. | ||
|
||
Example: `pin1 yin1` | ||
|
||
### `.STYLE_INITIALS` | ||
|
||
Initial consonant (of a Chinese syllable). | ||
|
||
Example: pinyin of `中国` is `zh g` | ||
|
||
Note: when a Han (汉字) without initial consonant, will convert to empty string. | ||
|
||
### `.STYLE_FIRST_LETTER` | ||
|
||
First letter style. | ||
|
||
Example: `p y` | ||
|
||
|
||
## Test | ||
|
||
``` | ||
npm test | ||
``` | ||
|
||
## Q&A | ||
|
||
### What's the different Node version and Web version? | ||
|
||
`pinyin` support Node and Web browser now, the API and usage is complete same. | ||
|
||
But the Web version is simple than Node version. Just frequently-used dict, | ||
without segmentation, and the dict is compress for web. | ||
|
||
Because of Traditional and Segmentation, the convert result will be not complete same. | ||
and the test case have some different too. | ||
|
||
| Feature | Web version | Node version | | ||
|--------------|---------------------------------|----------------------------------| | ||
| Dict | Frequently-used Dict, Compress. | Complete Dict, without Compress. | | ||
| Segmentation | NO | Segmentation options. | | ||
| Traditional | NO | Full Traditional support. | | ||
|
||
|
||
## Donate | ||
|
||
If this module is helpful for you, please Star this repository. | ||
|
||
And you have chioce donate to me via Aliapy or WeChat: | ||
|
||
![Alipay:hotoo.cn@gmail.com, WeChat:hotoome](http://hotoo.me/images/donate-hotoo.png) | ||
|
||
or donate my dear wife [@lizzie](https://github.com/lizzie) direct: | ||
|
||
![Alipay:shenyan1985@gmail.com, WeChat:SunsetSunrising](http://hotoo.me/images/donate-lizzie.png) | ||
|
||
The two donate way wile have the same result. | ||
|
||
## License | ||
|
||
[MIT](http://hotoo.mit-license.org/) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters