Title: | R Interface to Pullword Service |
---|---|
Description: | R Interface to Pullword Service for natural language processing in Chinese. It enables users to extract valuable words from text by deep learning models. For more details please visit the official site (in Chinese) <http://www.pullword.com/>. |
Authors: | Tong He <[email protected]> |
Maintainer: | Tong He <[email protected]> |
License: | LGPL |
Version: | 0.3 |
Built: | 2024-11-07 03:25:41 UTC |
Source: | https://github.com/hetong007/pullword |
Indicate whether the encoding of input string is BIG5.
isBIG5(string, combine = FALSE)
isBIG5(string, combine = FALSE)
string |
A character vector. |
combine |
Whether to combine all the strings. |
Logical value.
Jian Li <[email protected]>
Indicate whether the encoding of input string is GB18030.
isGB18030(string, combine = FALSE)
isGB18030(string, combine = FALSE)
string |
A character vector. |
combine |
Whether to combine all the strings. |
Logical value.
Jian Li <[email protected]>
Indicate whether the encoding of input string is GB2312.
isGB2312(string, combine = FALSE)
isGB2312(string, combine = FALSE)
string |
A character vector. |
combine |
Whether to combine all the strings. |
Logical value.
Jian Li <[email protected]>
Indicate whether the encoding of input string is GBK.
isGBK(string, combine = FALSE)
isGBK(string, combine = FALSE)
string |
A character vector. |
combine |
Whether to combine all the strings. |
Logical value.
Jian Li <[email protected]>
Indicate whether the encoding of input string is UTF-8.
isUTF8(string, combine = FALSE)
isUTF8(string, combine = FALSE)
string |
A character vector. |
combine |
Whether to combine all the strings. |
Logical value.
Jian Li <[email protected]>
This function deals with communication with the server. The result will be parsed in a vector or a matrix, depends on show probability or not
pullword
pullword(input = NULL, file = NULL, threshold = 0, showProb = FALSE)
pullword(input = NULL, file = NULL, threshold = 0, showProb = FALSE)
input |
The input text. |
file |
The input file. |
threshold |
The minimum probability for the words appearing in the result. Should be a real value between 0 and 1. |
showProb |
logical. The return value would be a |
require(pullword) pullword('Replace this field with a Chinese sentence.',threshold=0,showProb=TRUE)
require(pullword) pullword('Replace this field with a Chinese sentence.',threshold=0,showProb=TRUE)
Convert encoding of Chinese string to UTF-8.
toUTF8(cnstring)
toUTF8(cnstring)
cnstring |
A Chinese string vector. |
Converted vectors.
Jian Li <[email protected]>