qdapRegex: qdapRegex v. 0.1.3
Description
BUG FIXES
pastex would throw a warning on a vector (e.g., pastex(letters)). This has been fixed.
youtube_id was documented under qdap_usa rather than qdap_supplement and contained an invalid hyperlink. This has been fixed.
rm_citation contained a bug that would not operate on citations with a comma in multiple authors before the and/& sign. See issue #4
NEW FEATURES
is.regex added as a logical check of a regular expression's validy (conforms to R's regular expression rules).
rm_postal_code added for removing/extracting/replacing U.S. postal codes.
Case wrapper functions, TC (title case), U (upper case), and L (lower case) added for convenient case manipulation.
group function added to allow for convenient wrapping of grouping parenthesis around regular expressions.
rm_citation_tex added to remove/extract/replace bibkey citations from a .tex (LaTeX) file.
regex_cheat data set and cheat function added to act as a quick reference for common regex task operations such a lookaheads.
rm_caps_phrase added to supplement rm_caps, extending the search to phases.
explain added to view a visual representation of a regular expression using http://www.regexper.com and http://rick.measham.id.au/paste/explain. Also takes named regular expressions from the regex_usa or other supplied dictionary.
MINOR FEATURES
last_occurrence regex added to the regex_supplement dictionary to find the last occurrence of delimiter.
word_boundary, word_boundary_left, and word_boundary_right added to regex_supplement dictionary to provide a true word boundary. Regexes adapted from: http://www.rexegg.com/regex-boundaries.html#real-word-boundary
rm_time2 regex added to the regex_usa dictionary to find time + AM/PM
IMPROVEMENTS
The regex_usa dictionary regular expressions: rm_hash, rm_tag, rm_tag2 and rm_between pick up grouping that allows for replacement of individual sections of the substring. See ?rm_hash and ?rm_tag for examples.
pastex picks up a sep argument to allow the user to choose what string is used to separate the collapsed expressions.
rm_citation, rm_citation2, and rm_citation3 now attempt to include last names that contain the lower case particles: von, van, de, da, and du.
Files
qdapRegex-qdapRegex_v_0.1.3.zip
Files
(381.9 kB)
| Name | Size | Download all |
|---|---|---|
|
md5:c66a19a48f70a71aa8dbf3fd0301fb50
|
381.9 kB | Preview Download |
Additional details
Related works
- Is supplement to
- https://github.com/trinker/qdapRegex/tree/qdapRegex_v_0.1.3 (URL)