[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[no subject]
- <!--x-content-type: text/plain -->
- <!--x-date: Thu Jul 7 10:11:30 2005 -->
- <!--x-from-r13: wvzzlbyvire ng tznvy.pbz (Xvzzl Ayvire) -->
- <!--x-message-id: [email protected] -->
- <!--x-reference: [email protected] --> "http://www.w3.org/TR/html4/loose.dtd">
- <!--x-subject: [ale] Parsing CSV file in perl -->
- <li><em>date</em>: Thu Jul 7 10:11:30 2005</li>
- <li><em>from</em>: jimmyoliver at gmail.com (Jimmy Oliver)</li>
- <li><em>in-reply-to</em>: <<a href="msg00078.html">[email protected]</a>></li>
- <li><em>references</em>: <<a href="msg00078.html">[email protected]</a>></li>
- <li><em>subject</em>: [ale] Parsing CSV file in perl</li>
I usually convert tab separated CSV's to actual comma delimited CSV's
then I use the perl split function.
> Here is a sample piece of data.
>
> Id LASTNAME FIRSTNAME
> Adams Portia
> 10572 Alexander Robert
>
> You can see that the first row does not have an ID. This can be true
> for all columns. They may or may not have values.
>
> Here is how I'm trying the parse:
>
> open TXT, "< Expanded_2005_Select_1.csv";
> while(<TXT>) {
> m/^(\d+?)\t/;
> print "$1\n";
> }
>
> Each columns is tab delimeted. When I run this I get the lastname in $1
> for the first line and the the ID in $1 for the second line. I need to
> somehow create a regex that would be unforgiving of nothing being there.
>
> Data file looks like this:
> 1 "ID" "LASTNAME" "FIRSTNAME" "TITLE" "COMPANY"
> "ADDRESS " "ADDRESS2" "CITY" "STATE" "ZIPCODE"
> "COUNTRY" "PHONE" "EMAIL" "REGTYPE" "DATE" "TIME"
> "Question1" "Questio n2" "Question3" "READERID"
> 2 "Adams" "Portia" "Director" "The Rockefeller
> Univers ity" "1230 York Ave " "New York"
> "NY" "10021-6 399" "USA" 2123277719
> "adams at rockefeller.edu" "Member"
> 3 10572 "Alexander" "Robert" "Manager Voice & Video
> Solution" "Air Products and Chemicals, Inc" "7201
> Hamilton Blvd" "Allentown" "PA" "18195-1501"
> "USA" "610-481-7156" "alexanrw at airproducts.com" "Member"
> 06/12/2005 06:06:14 pm 60711
>
> The 1,2,3 that you see is the line numbers in VI
>
>
> _______________________________________________
> Ale mailing list
> Ale at ale.org
> <a rel="nofollow" href="http://www.ale.org/mailman/listinfo/ale">http://www.ale.org/mailman/listinfo/ale</a>
>
--
________________________
Jimmy Oliver
<a rel="nofollow" href="http://www.jimmyoliver.net">http://www.jimmyoliver.net</a>
email: jimmy at jimmyoliver.net
lists: jimmyoliver at gmail.com
</pre>
<!--X-Body-of-Message-End-->
<!--X-MsgBody-End-->
<!--X-Follow-Ups-->
<hr>
<!--X-Follow-Ups-End-->
<!--X-References-->
<ul><li><strong>References</strong>:
<ul>
<li><strong><a name="00078" href="msg00078.html">[ale] Parsing CSV file in perl</a></strong>
<ul><li><em>From:</em> cfowler at outpostsentinel.com (Christopher Fowler)</li></ul></li>
</ul></li></ul>
<!--X-References-End-->
<!--X-BotPNI-->
<ul>
<li>Prev by Date:
<strong><a href="msg00081.html">[ale] Tracking Mail Blast Logs</a></strong>
</li>
<li>Next by Date:
<strong><a href="msg00083.html">[ale] ip route and using linux as a "real" router</a></strong>
</li>
<li>Previous by thread:
<strong><a href="msg00078.html">[ale] Parsing CSV file in perl</a></strong>
</li>
<li>Next by thread:
<strong><a href="msg00095.html">[ale] Parsing CSV file in perl</a></strong>
</li>
<li>Index(es):
<ul>
<li><a href="maillist.html#00082"><strong>Date</strong></a></li>
<li><a href="threads.html#00082"><strong>Thread</strong></a></li>
</ul>
</li>
</ul>
<!--X-BotPNI-End-->
<!--X-User-Footer-->
<!--X-User-Footer-End-->
</body>
</html>