Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Support
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
erp5
Project overview
Project overview
Details
Activity
Releases
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Issues
0
Issues
0
List
Boards
Labels
Milestones
Merge Requests
0
Merge Requests
0
Analytics
Analytics
Repository
Value Stream
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Create a new issue
Commits
Issue Boards
Open sidebar
Eugene Shen
erp5
Commits
3eb9c812
Commit
3eb9c812
authored
Sep 29, 2011
by
Julien Muchembled
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Python monkey-patch to speed up email parsing
parent
382916b8
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
56 additions
and
0 deletions
+56
-0
product/ERP5Type/patches/python.py
product/ERP5Type/patches/python.py
+56
-0
No files found.
product/ERP5Type/patches/python.py
View file @
3eb9c812
...
...
@@ -113,6 +113,62 @@ if sys.version_info < (2, 7):
collections
.
OrderedDict
=
OrderedDict
if
1
:
# Speed up email parsing (see also http://bugs.python.org/issue1243730)
from
email
import
Parser
as
parser
,
FeedParser
as
feedparser
# BBB
NLCRE_crack_split
=
feedparser
.
NLCRE_crack
.
split
def
push
(
self
,
data
):
"""Push some new data into this object."""
# <patch>
if
self
.
_partial
[
-
1
:]
==
'
\
r
'
:
parts
=
NLCRE_crack_split
(
'
\
r
'
+
data
)
parts
[
0
]
=
self
.
_partial
[:
-
1
]
else
:
parts
=
NLCRE_crack_split
(
data
)
parts
[
0
]
=
self
.
_partial
+
parts
[
0
]
# </patch>
# The *ahem* interesting behaviour of re.split when supplied grouping
# parentheses is that the last element of the resulting list is the
# data after the final RE. In the case of a NL/CR terminated string,
# this is the empty string.
self
.
_partial
=
parts
.
pop
()
#GAN 29Mar09 bugs 1555570, 1721862 Confusion at 8K boundary ending with \r:
# is there a \n to follow later?
if
not
self
.
_partial
and
parts
and
parts
[
-
1
].
endswith
(
'
\
r
'
):
self
.
_partial
=
parts
.
pop
(
-
2
)
+
parts
.
pop
()
# parts is a list of strings, alternating between the line contents
# and the eol character(s). Gather up a list of lines after
# re-attaching the newlines.
lines
=
[]
for
i
in
range
(
len
(
parts
)
//
2
):
lines
.
append
(
parts
[
i
*
2
]
+
parts
[
i
*
2
+
1
])
self
.
pushlines
(
lines
)
feedparser
.
BufferedSubFile
.
push
=
push
FeedParser
=
feedparser
.
FeedParser
def
parse
(
self
,
fp
,
headersonly
=
False
):
"""Create a message structure from the data in a file.
Reads all the data from the file and returns the root of the message
structure. Optional headersonly is a flag specifying whether to stop
parsing after reading the headers or not. The default is False,
meaning it parses the entire contents of the file.
"""
feedparser
=
FeedParser
(
self
.
_class
)
if
headersonly
:
feedparser
.
_set_headersonly
()
while
True
:
# <patch>
data
=
fp
.
read
(
65536
)
# </patch>
if
not
data
:
break
feedparser
.
feed
(
data
)
return
feedparser
.
close
()
parser
.
Parser
.
parse
=
parse
# Workaround bad use of getcwd() in docutils.
# Required by PortalTransforms.transforms.rest
from
docutils
import
utils
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment