[Chicago] Fwd: Message from contact form: Code for recognizing UTF-8 encoded inputt
Jordan Bettis
jordanb at hafd.org
Sun Apr 13 01:56:48 CEST 2014
To clarify, I didn't use input, the original poster did. I just added
the "decode".
On 04/12/2014 06:02 PM, Tim Saylor wrote:
> Jordan's code used input() to accept the word, but you probably want to
> use raw_input(). The former runs eval() on the word you entered. It's
> a weird user input quirk in python.
>
> Also, I don't believe you can really enter "unicode" characters. You'll
> enter unicode code points encoded in UTF-8 or Latin-1 or whatever your
> system is providing. That gets decoded from the system encoding, and
> then re-encoded into UTF-8 (or whatever you specify) when you write the
> file.
>
>
> On Thu, Apr 10, 2014 at 11:09 AM, Jordan Bettis <jordanb at hafd.org
> <mailto:jordanb at hafd.org>> wrote:
>
> This should work, although I didn't test it. ISO-8859-5 is Cyrillic. If
> you want to work with Latin1 you'll have to change it to ISO-8859-1:
>
>
> import os, sys, string, math, re, shutil, codecs
>
> f= open('testfile.txt', 'w', encoding='utf-8')
>
> inword = input("\nEnter the word to test: ")
> inword = inword.decode("iso-8859-5")
>
> print("=========================\n")
> if inword == "":
> print("\nYou just hit return; not a word\n")
> else:
> print("\nYour word was "+inword+"\n")
>
> f.write(inword)
> f.close
>
> endit = input("\nType enter to exit program")
>
>
>
>
> On 04/09/2014 03:39 PM, Robert Dean wrote:
> > Hello,
> >
> > This is a very embarrassing "baby" Python question. But a Python
> coding
> > friend of mine in AZ won't be able for consulting until over a
> month from
> > now.
> >
> > To summarize the garbled message below, would appreciate knowing
> > how to modify the attached testword1.py code so that it can both
> recognize
> > input unicode characters a n d output them correctly in a UTF-8
> encoded
> > file. As it stands: n o Cyrillic characters are recognized,
> only some
> > Greek
> > characters are, and Latin-1 accented characters are mostly (but
> not a l l)
> > recognized.
> >
> > Bob Dean
> > Chicago
> >
> > robertBdean at gmail.com <mailto:robertBdean at gmail.com>
> <mailto:robertBdean at gmail.com <mailto:robertBdean at gmail.com>>
> >
> >
> >
> > ---------- Forwarded message ----------
> > From: *Cezar Jenkins* <emperorcezar at gmail.com
> <mailto:emperorcezar at gmail.com>
> > <mailto:emperorcezar at gmail.com <mailto:emperorcezar at gmail.com>>>
> > Date: Wed, Apr 9, 2014 at 9:25 AM
> > Subject: Re: Message from contact form: Code for recognizing UTF-8
> > encoded inputt
> > To: robertbdean at gmail.com <mailto:robertbdean at gmail.com>
> <mailto:robertbdean at gmail.com <mailto:robertbdean at gmail.com>>
> > Cc: brianhray at gmail.com <mailto:brianhray at gmail.com>
> <mailto:brianhray at gmail.com <mailto:brianhray at gmail.com>>
> >
> >
> > Hello,
> >
> > The contact form only goes to a few Chipy organizers. To answer your
> > question, you’ll probably want to post to our mailing list. You
> can find
> > a link to it on our homepage.
> >
> > Thanks!
> >
> > --
> > Cezar Jenkins
> > @emperorcezar
> >
> > From: donotreply at chipy.org <mailto:donotreply at chipy.org>
> <mailto:donotreply at chipy.org <mailto:donotreply at chipy.org>>
> > donotreply at chipy.org <mailto:donotreply at chipy.org>
> <mailto:donotreply at chipy.org <mailto:donotreply at chipy.org>>
> > Reply: robertbdean at gmail.com <mailto:robertbdean at gmail.com>
> <mailto:robertbdean at gmail.com <mailto:robertbdean at gmail.com>>
> > robertbdean at gmail.com <mailto:robertbdean at gmail.com>
> <mailto:robertbdean at gmail.com <mailto:robertbdean at gmail.com>>
> > Date: April 9, 2014 at 12:39:02 AM
> > To: emperorcezar at gmail.com <mailto:emperorcezar at gmail.com>
> <mailto:emperorcezar at gmail.com <mailto:emperorcezar at gmail.com>>
> > emperorcezar at gmail.com <mailto:emperorcezar at gmail.com>
> <mailto:emperorcezar at gmail.com <mailto:emperorcezar at gmail.com>>,
> > brianhray at gmail.com <mailto:brianhray at gmail.com>
> <mailto:brianhray at gmail.com <mailto:brianhray at gmail.com>>
> brianhray at gmail.com <mailto:brianhray at gmail.com>
> > <mailto:brianhray at gmail.com <mailto:brianhray at gmail.com>>
> > Subject: Message from contact form: Code for recognizing UTF-8 encoded
> > inputt
> >
> >>
> >> Message from the contact form
> >> Sender: Robert Dean (robertbdean at gmail.com
> <mailto:robertbdean at gmail.com>
> >> <mailto:robertbdean at gmail.com <mailto:robertbdean at gmail.com>>)
> >> Category:
> >> The message follows.
> >>
> ================================================================================
> >>
> >> Hi,
> >>
> >> A friend of mine has been writing some Python code for me to
> >> "massage" output files from a stat. program we're
> >> maintaining. The issue is what lines to add to several Python
> programs
> >> so that they blow don't up when encountering unicode characters.
> >> I'm aware of the pieces of the required code to specify that an
> >> input file is UTF-8 encoded. B u t , I'm not a Python programmer
> >> myself. And my friend won't be available to help me until mid-May
> >> at the soonest. Could someone contact about a solution? THANKS. Bob
> >> Dean, Chicago
> >>
> ================================================================================
> >>
> >>
> >> --
> >> message sent with envelope - a contact form app for Django
> >
> >
> >
> > _______________________________________________
> > Chicago mailing list
> > Chicago at python.org <mailto:Chicago at python.org>
> > https://mail.python.org/mailman/listinfo/chicago
> >
>
> _______________________________________________
> Chicago mailing list
> Chicago at python.org <mailto:Chicago at python.org>
> https://mail.python.org/mailman/listinfo/chicago
>
>
>
>
> --
> @tsaylor
> http://www.timsaylor.com/
>
>
> _______________________________________________
> Chicago mailing list
> Chicago at python.org
> https://mail.python.org/mailman/listinfo/chicago
More information about the Chicago
mailing list