react-native-japanese-tokenizer

Introduction: Async Japanese Tokenizer Native Plugin for React Native for iOS and Android
More: Author   ReportBugs   
Tags:

Async Japanese tokenizer native plugin for React Native that works on iOS and Android. It tokenizes Japanese text in background thread so it won't block UI. It comes in handy for such as indexing a lot of texts for full-text search.

screenshot

How it works

iOS

It utilizes CFStringTokenizer to tokenize strings into words.

Android

The implementation is based on TinySegmenter. It is lightweight word segmenter which works without dictionaries.

Getting started

$ npm install react-native-japanese-tokenizer --save

Mostly automatic installation

$ react-native link react-native-japanese-tokenizer

Manual installation

iOS

  1. In XCode, in the project navigator, right click LibrariesAdd Files to [your project's name]
  2. Go to node_modulesreact-native-japanese-tokenizer and add RNJapaneseTokenizer.xcodeproj
  3. In Xcode, in the project navigator, select your project. Add libRNJapaneseTokenizer.a to your project's Build PhasesLink Binary With Libraries
  4. Run your project (Cmd+R)

Android

  1. Open up android/app/src/main/java/[...]/MainActivity.java
    • Add import dog.craftz.japanese_tokenizer.RNJapaneseTokenizerPackage; to the imports at the top of the file
    • Add new RNJapaneseTokenizerPackage() to the list returned by the getPackages() method
  2. Append the following lines to android/settings.gradle:
      include ':react-native-japanese-tokenizer'
      project(':react-native-japanese-tokenizer').projectDir = new File(rootProject.projectDir,  '../node_modules/react-native-japanese-tokenizer/android')
    
  3. Insert the following lines inside the dependencies block in android/app/build.gradle:
       compile project(':react-native-japanese-tokenizer')
    

Usage

Using Promise:

import Tokenizer from 'react-native-japanese-tokenizer'

var text = "週休七日で働きたい"
Tokenizer.tokenize(text).then(tokens => {
  ...
})

async/await:

import Tokenizer from 'react-native-japanese-tokenizer'

var text = "週休七日で働きたい"
const tokens = await Tokenizer.tokenize(text)

Also check out the test to learn more.

License

MIT License. By Takuya Matsuyama (@craftzdog).

Note that this module is based on following libraries with respective license. Thank you for the fantastic works!

Apps
About Me
GitHub: Trinea
Facebook: Dev Tools