pinyin

:cn: 汉字拼音 ➜ hàn zì pīn yīn

7,714

863

7,714

View on GitHub View on NPM

Top Related Projects

pinyin4j

1,288

A copy of http://sourceforge.net/projects/pinyin4j, then deploy it to maven central repository.

Quick Overview

Hotoo/pinyin is a JavaScript library for converting Chinese characters to their corresponding Pinyin (romanization) representation. It supports both simplified and traditional Chinese characters and offers various output options, including tones and diacritical marks.

Pros

Supports both simplified and traditional Chinese characters
Offers multiple output formats (with/without tones, with diacritical marks)
Provides customization options for handling polyphones and heteronyms
Lightweight and easy to integrate into web projects

Cons

Limited to Mandarin Chinese Pinyin (doesn't support other Chinese dialects)
May require additional dictionaries for accurate handling of less common characters
Occasional inaccuracies with complex phrases or names

Code Examples

Basic usage:

import pinyin from 'pinyin';

console.log(pinyin('中文'));
// Output: [ [ 'zhong' ], [ 'wen' ] ]

Using tone numbers:

console.log(pinyin('中文', { style: pinyin.STYLE_TONE2 }));
// Output: [ [ 'zhong1' ], [ 'wen2' ] ]

Using diacritical marks:

console.log(pinyin('中文', { style: pinyin.STYLE_NORMAL }));
// Output: [ [ 'zhōng' ], [ 'wén' ] ]

Customizing heteronym handling:

console.log(pinyin('省长', {
  heteronym: true,
  segment: true
}));
// Output: [ [ 'shěng', 'xǐng' ], [ 'zhǎng', 'cháng' ] ]

Getting Started

To use hotoo/pinyin in your project, follow these steps:

Install the package:
```
npm install pinyin
```

Import and use in your JavaScript code:

import pinyin from 'pinyin';

const result = pinyin('你好，世界');
console.log(result);
// Output: [ [ 'ni' ], [ 'hao' ], [ 'shi' ], [ 'jie' ] ]

Customize options as needed:

const result = pinyin('你好，世界', {
  style: pinyin.STYLE_TONE2,
  heteronym: true
});
console.log(result);
// Output: [ [ 'ni3' ], [ 'hao3' ], [ 'shi4' ], [ 'jie4' ] ]

Competitor Comparisons

python-pinyin

5,169

汉字转拼音(pypinyin)

Pros of python-pinyin

Written in Python, making it more accessible for Python developers
Supports multiple pinyin styles (e.g., NORMAL, TONE, TONE2, INITIALS, FIRST_LETTER)
Provides flexible customization options for handling non-Chinese characters

Cons of python-pinyin

Limited to Python environments, unlike pinyin which is JavaScript-based
May have slower performance compared to pinyin due to language differences
Less extensive documentation and examples compared to pinyin

Code Comparison

python-pinyin:

from pypinyin import pinyin, Style
print(pinyin('中心', style=Style.TONE))
# Output: [['zhōng'], ['xīn']]

pinyin:

const pinyin = require("pinyin");
console.log(pinyin('中心', {style: pinyin.STYLE_TONE}));
// Output: [['zhōng'], ['xīn']]

Both libraries offer similar functionality for converting Chinese characters to pinyin, with slight differences in syntax and available options. The python-pinyin library provides more granular control over pinyin styles, while pinyin offers a simpler API with fewer configuration options.

The choice between these libraries largely depends on the programming language preference and specific project requirements. Python developers may find python-pinyin more convenient, while JavaScript developers might prefer pinyin for its integration with Node.js and browser environments.

pinyin4j

1,288

A copy of http://sourceforge.net/projects/pinyin4j, then deploy it to maven central repository.

Pros of pinyin4j

Written in Java, offering better integration with Java-based projects
Provides more extensive tone mark and number notation options
Includes support for multiple pinyin systems (e.g., Wadegiles, Yale)

Cons of pinyin4j

Less frequently updated compared to pinyin
Requires more setup and configuration for non-Java projects
Limited customization options for output formats

Code Comparison

pinyin4j:

PinyinHelper.toHanyuPinyinStringArray('中')[0]; // "zhong"

pinyin:

pinyin('中')[0][0]; // "zhong"

Additional Notes

Both libraries aim to convert Chinese characters to pinyin, but they cater to different ecosystems. pinyin4j is more suitable for Java developers and offers more comprehensive pinyin notation options. On the other hand, pinyin is a JavaScript library that integrates more easily with web-based projects and has a simpler API.

The choice between these libraries largely depends on the programming language of your project and the specific pinyin conversion requirements you have. Consider factors such as ease of integration, performance, and available features when making your decision.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

pÄ«nyÄ«n (v4)

pÄ«nyÄ«n, æ±åæ¼é³è½¬æ¢å·¥å·ã

ç®ä½ä¸æ | English | íêµì´

è½¬æ¢ä¸æåç¬¦ä¸ºæ¼é³ãå¯ä»¥ç¨äºæ±åæ³¨é³ãæåºãæ£ç´¢ã

æ³¨ï¼è¿ä¸ªçæ¬åæ¶æ¯æå¨ Node å Web æµè§å¨ç¯å¢è¿è¡ï¼

Python çè¯·å³æ³¨ mozillazg/python-pinyin

ç¹æ§

æ ¹æ®è¯ç»æºè½å¹éææ£ç¡®çæ¼é³ã
æ¯æå¤é³åã
ç®åçç¹ä½æ¯æã
æ¯æå¤ç§ä¸åæ¼é³é£æ ¼ã

å®è£

via npm:

npm install pinyin --save

ç¨æ³

å¼åèï¼

import pinyin from "pinyin";

console.log(pinyin("ä¸å¿"));    // [ [ 'zhÅng' ], [ 'xÄ«n' ] ]

console.log(pinyin("ä¸å¿", {
  heteronym: true,              // å¯ç¨å¤é³åæ¨¡å¼
}));                            // [ [ 'zhÅng', 'zhÃ²ng' ], [ 'xÄ«n' ] ]

console.log(pinyin("ä¸å¿", {
  heteronym: true,              // å¯ç¨å¤é³åæ¨¡å¼
  segment: true,                // å¯ç¨åè¯ï¼ä»¥è§£å³å¤é³åé®é¢ãé»è®¤ä¸å¼å¯ï¼ä½¿ç¨ true å¼å¯ä½¿ç¨ Intl.Segmenter åè¯åºã
}));                            // [ [ 'zhÅng' ], [ 'xÄ«n' ] ]

console.log(pinyin("ä¸å¿", {
  segment: "@node-rs/jieba",    // æå®åè¯åºï¼å¯ä»¥æ¯ "Intl.Segmenter", "nodejieba"ã"segmentit"ã"@node-rs/jieba"ã
}));                            // [ [ 'zhÅng' ], [ 'xÄ«n' ] ]

console.log(pinyin("æåæ¬¢ä½ ", {
  segment: "segmentit",         // å¯ç¨åè¯
  group: true,                  // å¯ç¨è¯ç»
}));                            // [ [ 'wÇ' ], [ 'xÇhuÄn' ], [ 'nÇ' ] ]

console.log(pinyin("ä¸å¿", {
  style: "initials",            // è®¾ç½®æ¼é³é£æ ¼ã
  heteronym: true,              // å³ä½¿æå¤é³åï¼å ä¸ºæ¼é³é£æ ¼éæ©ï¼éå¤çä¹ä¼åå¹¶ã
}));                            // [ [ 'zh' ], [ 'x' ] ]

console.log(pinyin("åå¤«äºº", {
  mode: "surname",              // å§åæ¨¡å¼ã
}));                            // [ ['huÃ '], ['fÅ«'], ['rÃ©n'] ]

å½ä»¤è¡ï¼

$ pinyin ä¸å¿
zhÅng xÄ«n
$ pinyin -h

ç±»å

IPinyinOptions

ä¼ å¥ç» pinyin æ¹æ³çç¬¬äºä¸ªåæ°çéé¡¹ç±»åã

export interface IPinyinOptions {
  style?: IPinyinStyle; // æ¼é³è¾åºå½¢å¼
  mode?: IPinyinMode, // æ¼é³æ¨¡å¼
  // æå®åè¯åºã
  // ä¸ºäºå¼å®¹èçæ¬ï¼å¯ä»¥ä½¿ç¨ boolean ç±»åæå®æ¯å¦å¼å¯åè¯ï¼é»è®¤å¼å¯ã
  segment?: IPinyinSegment | boolean;
  // æ¯å¦è¿åå¤é³å
  heteronym?: boolean;
  // æ¯å¦åç»è¯ç»æ¼é³
  group?: boolean;
  compact?: boolean;
}

IPinyinStyle

export type IPinyinStyle =
  "normal" | "tone" | "tone2" | "to3ne" | "initials" | "first_letter" | "passport" | // æ¨èä½¿ç¨å°åï¼åè¾åºçæ¼é³ä¸è´
  "NORMAL" | "TONE" | "TONE2" | "TO3NE" | "INITIALS" | "FIRST_LETTER" | "PASSPORT" | // æ¹ä¾¿èçæ¬è¿ç§»
  0        | 1      | 2       | 5       | 3          | 4;               // å¼å®¹èçæ¬

IPinyinMode

æ¼é³æ¨¡å¼ï¼é»è®¤æ®éæ¨¡å¼ï¼å¯ä»¥æå®äººåæ¨¡å¼ã

// - NORMAL: æ®éæ¨¡å¼
// - SURNAME: å§æ°æ¨¡å¼ï¼ä¼åä½¿ç¨å§æ°çæ¼é³ã
export type IPinyinMode =
  "normal" | "surname" |
  "NORMAL" | "SURNAME";

IPinyinSegment

åè¯æ¹å¼ã

é»è®¤å³é falseï¼
ä¹å¯ä»¥è®¾ç½®ä¸º true å¼å¯ï¼Web å Node çä¸åä½¿ç¨ "Intl.Segmenter" åè¯ã
ä¹å¯ä»¥å£°æä»¥ä¸åç¬¦ä¸²æ¥æå®åè¯ç®æ³ãä½ç®å Web çåªæ¯æ "Intl.Segmenter" å "segmentit" åè¯ã

export type IPinyinSegment = "Intl.Segmenter" | "nodejieba" | "segmentit" | "@node-rs/jieba";

API

æ¹æ³ `<Array> pinyin(words: string[, options: IPinyinOptions])`

å°ä¼ å¥çä¸æåç¬¦ä¸² (words) è½¬æ¢ææ¼é³ç¬¦å·ä¸²ã

æ¹æ³ `Number compare(a, b)`

ææ¼é³æåºçé»è®¤ç®æ³ã

æ¹æ³ `string[][] compact(pinyinResult array[][])`

å°æ¼é³å¤é³åä»¥åç§å¯è½çç»åæååæ¢æç´§åå½¢å¼ãåè options.compact

åæ°

`<Boolean|IPinyinSegment> options.segment`

é»è®¤ä¸å¯ç¨åè¯ã
å¦æ segemnt = trueï¼é»è®¤ä½¿ç¨ Intl.Segmenter åè¯ã
å¯ä»¥æå® "Intl.Segmenter"ã"nodejieba"ã"segmentit"ã"@node-rs/jieba" è¿è¡åè¯ã

`<Boolean> options.heteronym`

æ¯å¦å¯ç¨å¤é³åæ¨¡å¼ï¼é»è®¤å³éã

å³éå¤é³åæ¨¡å¼æ¶ï¼è¿åæ¯ä¸ªæ±åç¬¬ä¸ä¸ªå¹éçæ¼é³ã

å¯ç¨å¤é³åæ¨¡å¼æ¶ï¼è¿åå¤é³åçæææ¼é³åè¡¨ã

`<Boolean> options.group`

æè¯ç»åç»æ¼é³ï¼ä¾å¦ï¼

æåæ¬¢ä½ 
wÇ xÇhuÄn nÇ

`<IPinyinStyle> options.style`

æå®æ¼é³é£æ ¼ãå¯ä»¥ä½¿ç¨ä»¥ä¸ç¹å®åç¬¦ä¸²ææ°å¼æå®ï¼

IPinyinStyle =
  "normal" | "tone" | "tone2" | "to3ne" | "initials" | "first_letter" | "passport" | // æ¨èä½¿ç¨å°åï¼åè¾åºçæ¼é³ä¸è´
  "NORMAL" | "TONE" | "TONE2" | "TO3NE" | "INITIALS" | "FIRST_LETTER" | "PASSPORT" | // æ¹ä¾¿èçæ¬è¿ç§»
  0        | 1      | 2       | 5       | 3          | 4;               // å¼å®¹èçæ¬

`NORMAL`, `normal`

æ®éé£æ ¼ï¼å³ä¸å¸¦å£°è°ã

å¦ï¼pin yin

`TONE`, `tone`

å£°è°é£æ ¼ï¼æ¼é³å£°è°å¨éµæ¯ç¬¬ä¸ä¸ªåæ¯ä¸ã

æ³¨ï¼è¿æ¯é»è®¤çé£æ ¼ã

å¦ï¼pÄ«n yÄ«n

`TONE2`, `tone2`

å¦ï¼pin1 yin1

`TO3NE`, `to3ne`

å¦ï¼pi1n yi1n

`INITIALS`, `initials`

å¦ï¼ä¸å½ çæ¼é³ zh g

æ³¨ï¼å£°æ¯é£æ ¼ä¼åºå zh å zï¼ch å cï¼sh å sã

`FIRST_LETTER`, `first_letter`

é¦åæ¯é£æ ¼ï¼åªè¿åæ¼é³çé¦åæ¯é¨åã

å¦ï¼p y

`PASSPORT`, `passport`

options.mode

NORMALï¼æ®éæ¨¡å¼ï¼èªå¨è¯å«è¯»é³ã
SURNAMEï¼å§åæ¨¡å¼ï¼å¯¹äºæç¡®çå§ååºæ¯ï¼å¯ä»¥æ´åç¡®çè¯å«å§æ°çè¯»é³ã

options.compact

pinyin("ä½ å¥½å", { compact:false });
> [[nÇ], [hÇo,hÃ o], [ma,mÃ¡,mÇ]]

pinyin("ä½ å¥½å", { compact:true });
> [
>   [nÇ,hÇo,ma], [nÇ,hÇo,mÃ¡], [nÇ,hÇo,mÇ],
>   [nÇ,hÃ o,ma], [nÇ,hÃ o,mÃ¡], [nÇ,hÃ o,mÇ],
> ]

ä½ ä¹å¯ä»¥å¿è¦æ¶ä½¿ç¨ compact() å½æ°å¤ç pinyin(han, {compact:false}) è¿åçç»æã

Test

npm test

Q&A

å³äº Web çå¦ä½ä½¿ç¨

å®å¨ä¸æ³æè¾ï¼å¯ä»¥è¯è¯ https://www.jsdelivr.com/package/npm/pinyin

ä¸ºä»ä¹æ²¡æ `y`, `w`, `yu` å ä¸ªå£°æ¯ï¼

å¦ä½å®ç°ææ¼é³æåºï¼

pinyin æ¨¡åæä¾äºé»è®¤çæåºæ¹æ¡ï¼

const pinyin = require('pinyin');

const data = 'æè¦æåº'.split('');
const sortedData = data.sort(pinyin.compare);

const pinyin = require('pinyin');

const data = 'æè¦æåº'.split('');

// å»ºè®®å°æ±åçæ¼é³æä¹ååå¨èµ·æ¥ã
const pinyinData = data.map(han => ({
  han: han,
  pinyin: pinyin(han)[0][0], // å¯ä»¥èªè¡éæ©ä¸åççææ¼é³æ¹æ¡åé£æ ¼ã
}));
const sortedData = pinyinData.sort((a, b) => {
  return a.pinyin.localeCompare(b.pinyin);
}).map(d => d.han);

node çå web çæä»ä¹å¼åï¼

ç±äºåè¯åç¹ä½ä¸æçç¹æ§ï¼é¨åæåµä¸çç»æä¹ä¸å°½ç¸åã

ç¹æ§	Web ç	Node ç
æ¼é³åº	å¸¸ç¨ååºãåç¼©ãåå¹¶	å®æ´ååºãä¸åç¼©ãåå¹¶
åè¯	æ²¡æåè¯	ä½¿ç¨åè¯ç®æ³ï¼å¤é³åæ¼é³æ´åç¡®ã
æ¼é³é¢åº¦æåº	ææ ¹æ®æ¼é³ä½¿ç¨é¢åº¦ä¼åçº§æåºã	å Web çã
ç¹ä½ä¸æ	æ²¡æç¹ä½ä¸ææ¯æã	æç®åçç¹ç®æ±åè½¬æ¢ã

ç±äºè¿äºåºå«ï¼æµè¯ä¸åè¿è¡ç¯å¢çç¨ä¾ä¹ä¸å°½ç¸åã

æèµ

Alipay:hotoo.cn@gmail.com, WeChat:hotoome

è®¸å¯è¯

MIT

Top Related Projects

python-pinyin

5,169

汉字转拼音(pypinyin)

pinyin4j

1,288

A copy of http://sourceforge.net/projects/pinyin4j, then deploy it to maven central repository.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

pinyin

Top Related Projects

python-pinyin

pinyin4j

Quick Overview

Pros

Cons

Code Examples

Getting Started

Competitor Comparisons

python-pinyin

Pros of python-pinyin

Cons of python-pinyin

Code Comparison

pinyin4j

Pros of pinyin4j

Cons of pinyin4j

Code Comparison

Additional Notes

Convert designs to code with AI

README

pÄ«nyÄ«n (v4)

ç¹æ§

å®è£

ç¨æ³

ç±»å

IPinyinOptions

IPinyinStyle

IPinyinMode

IPinyinSegment

API

æ¹æ³ <Array> pinyin(words: string[, options: IPinyinOptions])

æ¹æ³ Number compare(a, b)

æ¹æ³ string[][] compact(pinyinResult array[][])

åæ°

<Boolean|IPinyinSegment> options.segment

<Boolean> options.heteronym

<Boolean> options.group

<IPinyinStyle> options.style

NORMAL, normal

TONE, tone

TONE2, tone2

TO3NE, to3ne

INITIALS, initials

FIRST_LETTER, first_letter

PASSPORT, passport

options.mode

options.compact

Test

Q&A

å ³äº Web çå¦ä½ä½¿ç¨

ä¸ºä»ä¹æ²¡æ y, w, yu å ä¸ªå£°æ¯ï¼

å¦ä½å®ç°ææ¼é³æåºï¼

node çå web çæä»ä¹å¼åï¼

æèµ

è®¸å¯è¯

Top Related Projects

python-pinyin

pinyin4j

Convert designs to code with AI

NPM DownloadsLast 30 Days

ç¹æ§

å®è£

ç¨æ³

ç±»å

æ¹æ³ `<Array> pinyin(words: string[, options: IPinyinOptions])`

æ¹æ³ `Number compare(a, b)`

æ¹æ³ `string[][] compact(pinyinResult array[][])`

åæ°

`<Boolean|IPinyinSegment> options.segment`

`<Boolean> options.heteronym`

`<Boolean> options.group`

`<IPinyinStyle> options.style`

`NORMAL`, `normal`

`TONE`, `tone`

`TONE2`, `tone2`

`TO3NE`, `to3ne`

`INITIALS`, `initials`

`FIRST_LETTER`, `first_letter`

`PASSPORT`, `passport`

å³äº Web çå¦ä½ä½¿ç¨

ä¸ºä»ä¹æ²¡æ `y`, `w`, `yu` å ä¸ªå£°æ¯ï¼

å¦ä½å®ç°ææ¼é³æåºï¼

node çå web çæä»ä¹å¼åï¼

æèµ

è®¸å¯è¯