- spaCy Tutorial
- spaCy - Home
- spaCy - Introduction
- spaCy - Getting Started
- spaCy - Models and Languages
- spaCy - Architecture
- spaCy - Command Line Helpers
- spaCy - Top-level Functions
- spaCy - Visualization Function
- spaCy - Utility Functions
- spaCy - Compatibility Functions
- spaCy - Containers
- Doc Class ContextManager and Property
- spaCy - Container Token Class
- spaCy - Token Properties
- spaCy - Container Span Class
- spaCy - Span Class Properties
- spaCy - Container Lexeme Class
- Training Neural Network Model
- Updating Neural Network Model
- spaCy Useful Resources
- spaCy - Quick Guide
- spaCy - Useful Resources
- spaCy - Discussion
spaCy - Util.compile_infix_regex
This utility function will compile a sequence of infix rules into a regex object.
Argument
The table below explains its argument −
NAME | TYPE | DESCRIPTION |
---|---|---|
entries | Tuple | This argument represents the infix rules. For example, lang.punctuation.TOKENIZER_INFIXES</>. |
Syntax
infixes = ("…", "-", "—", r"(?<=[0-9])[+-*^](?=[0-9-])") infix_reg = util.compile_infix_regex(infixes) nlp.tokenizer.infix_finditer = infix_reg.finditer
Example
import spacy nlp = spacy.load('en_core_web_sm') infixes = ('') infix_reg = spacy.util.compile_infix_regex(infixes) nlp.tokenizer.infix_finditer = infix_reg.finditer doc = nlp("[A] works for [B] in [C].") print([t.text for t in doc]) # ['[A]', 'works', 'for', '[B]', 'in', '[C]', '.']
Output
Output ['[', 'A', ']', 'w', 'o', 'r', 'k', 's', 'f', 'o', 'r', '[', 'B', ']', 'i', 'n', '[', 'C', ']', '.']
spacy_util_get_data_path.htm
Advertisements
To Continue Learning Please Login
Login with Google