Leveraging “UNIX Tools” (GNU) for Data Analysis

Life would be so much easier if everything was in a database or pulled via API. But that is not the case. All too often we get data files (or have to send them) in various formats. This session discusses some of the tools available to help you figure out what the file looks like so you can pull it apart using those tools or your tool-of-preference. While the GNU version of these tools will be the focus, the skills learned apply to many different platforms (Microsoft’s Bash under Windows 10, Cygwin under Microsoft Windows, MAC OSX, the Linux core of Android, commercial Linux — like Red Hat Enterprise, and commercial UNIX — like IBM’s AIX or Sun/Oracle’s Solaris).

Of particular interest are ‘head’, ‘tail’, ‘wc’, ‘awk’, ‘dd conv’, and shells.

A few of the differences between UNIX/Linux and Windows will also be discussed in case you ever have to deal with those environments in our heterogeneous environments. This knowledge also comes in handy if you need to migrate code from an existing UNIX/Linux-based application.

David Horvath

David is an IT Professional who has worked with various platforms since the 1980’s with a variety of development and analysis tools.
This is not his first philly.net session and far from his first presentation – having presented workshops and seminars in Australia, France, the US, Canada, and Oxford England (about the British Author Nevil Shute).

His undergraduate is in Computer and Information Sciences from Temple University and holds a Masters is Organizational Dynamics from UPENN. He holds the Certified Computing Professional designation.

Most of his career has been in consulting (although recently he has been in house) in the Philadelphia PA area.

He has several books to his credit (none directly Windows related) and is an Adjunct Instructor covering IT topics.