Document: ezcDocumentPdfDefaultTokenizer
[ ]
[ Conversion ] [ Styles ]
[ ]
[ ]
[ ]
[ ]
Class: ezcDocumentPdfDefaultTokenizer
|
Tokenizer implementation for common texts, using whitespaces as word seperators. [
source]
Parents
ezcDocumentPdfTokenizer
|
--ezcDocumentPdfDefaultTokenizer
Inherited Constants
From
ezcDocumentPdfTokenizer:
Method Summary
|
public array |
tokenize(
$string )
Split string into words |
Inherited Methods
From
ezcDocumentPdfTokenizer :
Methods
tokenize
array tokenize(
string
$string )
Split string into words
This function takes a string and splits it into words. There are different mechanisms which indicate possible splitting points in the resulting word stream:
- self:SPACE: The renderer might render a space
- self:WRAP: The renderer might wrap the line at this position, but will
not render spaces.
A possible splitting of an english sentence might look like:
1. array(
2. 'Hello',
3. self:SPACE,
4. 'world!',
5. );
Non breaking spaces should not be splitted into multiple words, so there will be no break applied.
Parameters
| Name |
Type |
Description |
$string |
string |
|
Redefinition of
Last updated: Tue, 01 Sep 2009