Beefy Boxes and Bandwidth Generously Provided by pair Networks
Pathologically Eclectic Rubbish Lister
 
PerlMonks  

Sending a UTF-8 (Unicode) E-mail

by monarch (Priest)
on Jun 18, 2008 at 17:03 UTC ( #692745=perlmeditation: print w/ replies, xml ) Need Help??

This is a brief summary of code that I've found to send UTF-8 Unicode e-mails that can be sent through SMTP servers that only support 7-bit encoding.

Creating UTF-8 Formatted E-mail Using MIME::Entity

Many SMTP servers will not permit 8-bit characters. UTF-8 encoding requires 8-bit support because the high bit of the first byte is used to indicate successive bytes forming part of the same character.

Assume you have some text containing unicode. For example:

my $data = "Goodbye in Japanese is \x{3055}\x{3088}\x{306A}\x{3089}.";

When creating an e-mail use the "quoted-printable" form of encoding for the data which will correctly encode the UTF-8 characters into 7-bit form. E-mail headers, however, must be encoded in the "MIME-Header" form if they contain any UTF-8 characters (which would often be in the case of foreign language subject headings).

The 7-bit version of the e-mail is obtained using the stringify function.

use Encode; use MIME::Entity; my $entity = MIME::Entity->build( Type => "text/plain", Charset => "UTF-8", Encoding => "quoted-printable", Data => Encode::encode( "UTF-8", $data ), From => Encode::encode( "MIME-Header", $sender ), To => Encode::encode( "MIME-Header", $recipient ), Subject => Encode::encode( "MIME-Header", $subject ), );

Sending the Encoded Message Using Net::SMTP

Next, it is a relatively trivial matter of sending the appropriately coded message to a SMTP server (using Net::SMTP):

use Net::SMTP; my $smtp = Net::SMTP->new( "localhost", Hello => 'localhost.localdomain', Timeout => 15, Debug => 1, ); $smtp->mail( $sender ); if ( $smtp->recipient( $recipient ) ) { $smtp->data(); my $msg = $entity->stringify; while ( $msg =~ m/([^\r\n]*)(\r\n|\n\r|\r|\n)?/g ) { my $line = ( defined($1) ? $1 : "" ) . "\r\n"; $smtp->datasend( $line ); } if ( $smtp->dataend() ) { print( STDERR, "Message sent!\n" ); } else { print( STDERR, "Failure sending data.\n" ); } } else { print( STDERR, "Rejected recipient.\n" ); } $smtp->quit();

See Also

Update: corrected formatting.

Comment on Sending a UTF-8 (Unicode) E-mail
Select or Download Code
Reaped: Re: Sending a UTF-8 (Unicode) E-mail
by NodeReaper (Curate) on Jul 26, 2008 at 19:01 UTC
Re: Sending a UTF-8 (Unicode) E-mail
by Anonymous Monk on Jun 16, 2014 at 02:11 UTC

    I tried to add a date and a reply-to header. The date worked but not the reply-to. In strict, the reply-to throws an error 500 (can't have "reply" as stand alone word) and in non-strict, it just doesn't add it.

    Do you know how to solve this issue? Are there more complete examples out there? So far, this page was the best to solve my accents problems in the headers. Utf-8 characters were not causing problems in the message body.

    thanks,

    Serge.

      Hi, I'm not fully up-to-date on the issue, but I did have this note in my archives, which I can't find the original source, but here it is:
      Non USASCII characters *in headers* require special encoding: 1) use Encode; $x="...."; $x_for_header = Encode::encode('MIME-Q', $x); MIME-Q is the best for mostly USASCII strings MIME-B is the best for mostly non USASCII strings 2) Add one extra header to *fully* declare body encoding: print MAIL "Content-Transfer-Encoding: 8bit\n";
      Hope it helps.

      I'm not really a human, but I play one on earth.
      Old Perl Programmer Haiku ................... flash japh

Log In?
Username:
Password:

What's my password?
Create A New User
Node Status?
node history
Node Type: perlmeditation [id://692745]
Front-paged by Arunbear
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others chilling in the Monastery: (11)
As of 2014-07-11 15:38 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    When choosing user names for websites, I prefer to use:








    Results (230 votes), past polls