Thinking outside the box: Leveraging “UNIX Tools” (GNU) for Data Analysis and Validation

Life would be so much easier if everything was in a database or pulled via API. But that is not the case. All too often we get data files (or have to send them) in various formats. This session discusses some of the tools available to help you figure out what the file looks like so you can pull it apart using those tools or your tool-of-preference. While the GNU version of these tools will be the focus, the skills learned apply to many different platforms. Being available on so many platforms gives you lots of choices. Your choices include Microsoft’s Bash under Windows 10, Cygwin under many Microsoft Windows versions, MAC OSX, the Linux core of Android, commercial Linux — like Red Hat Enterprise, and commercial UNIX — like IBM’s AIX or Sun/Oracle’s Solaris.

Of particular interest are ‘head’, ‘tail’, ‘wc’, ‘awk’, ‘dd conv’, and shells.

A few of the differences between UNIX/Linux and Windows will also be discussed to ease your shifts in our heterogeneous environments. This knowledge also comes in handy if you need to migrate code from an existing UNIX/Linux-based application.

David Horvath

• David is an IT Professional who has worked with a wide variety of platforms over the years with a variety of development and analysis tools.
• This is not his first session and far from his first presentation – having presented workshops and seminars in Australia, France, the US, Canada, and Oxford England (about the British Author Nevil Shute).
• His undergraduate is in Computer and Information Sciences from Temple University and holds a Masters in Organizational Dynamics from UPENN. He holds the Certified Computing Professional designation.
• Most of his career has been in consulting (although recently he has been in house) in the Philadelphia PA area.
• He has several books to his credit (none directly Windows related) and is an Adjunct Instructor covering IT topics.