Posts in Category: SSIS

DenverSQL UG Meeting: Tim Mitchell presents Dirty Data? Clean it up! 

 Denver SQL Server User Group - Thursday, February 17, 2011
 

Main Presentation
"Dirty Data? Clean It Up! By Tim Mitchell"

Dirty Data? Clean It Up!

Dirty data is everywhere, and it's headed for a database near you.  Extraction, transformation, and loading (ETL) can be difficult, but often the most challenging component of that process is the validation and clean up of data.  Information must be cleansed in such a way that it retains its original message and business value, while conforming to the expectations of the destination system(s).  In this session, we will discuss some of the high-level challenges of dealing with dirty data, along with some infrastructure-agnostic recommended practices for doing so.  Further, we will dig in and get our fingernails dirty (pun intended) on some specific ways that SQL Server Integration Services (SSIS) can be used to cleanse incoming data.

About the Presenter

Tim Mitchell is a business intelligence consultant, database developer, speaker, and trainer.  He has been working with SQL Server for over 7 years, working primarily in business intelligence, ETL/SSIS, database development, and reporting.  He has earned a number of industry certifications and holds a Bachelor's Degree in Computer Science from Texas A&M at Commerce, and is a Microsoft SQL Server MVP.  Tim is a business intelligence consultant for Artis Consulting in the Dallas, Texas area.  As an active member of the community, Tim has spoken at venues including numerous SQL Saturday events, Houston Tech Fest, and various user groups and PASS virtual chapters.  He is a board member and speaker at the North Texas SQL Server User Group in Dallas, is co-chair of the PASS BI Virtual Chapter, and is an active volunteer for PASS.  Tim is an author and forum contributor on SQLServerCentral.com and has published dozens of SQL Server training videos on SQLShare.com.  You can read his blog at TimMitchell.net or follow him on Twitter attwitter.com/Tim_Mitchell.

 



As usual, we will make time for question and answer sessions, networking, and eating. Prizes will be awarded at the conclusion of the meeting.

Our meeting location has changed!

Starting in January 2011, 
we will be meeting at the Microsoft headquarters!
 

7595 W. Technology Way, Suite 400
Denver, CO 80237
(720) 528-1700
map


Meeting Agenda:
5:30 - 6:00 - Food / Socializing
6:00 - 6:10 - User Group Business
6:10 - 6:30 - Open Panel/Questions & Answers
6:30 – 7:00 – 30 minute presentation
7:00 - 8:00 – Main presentation
8:10 - Give Away / Wrap-Up

Thursday, February 17, 2011 9:40:48 AM Categories: Community PASS SSIS

Kimball University: Microsoft Data Warehouse in Depth 

Several years ago, late 2006, I decided to focus my career on Data Warehousing and Business Intelligence (DW/BI). I was fortunate to have picked up a copy of a book titled “The Microsoft Data Warehouse Toolkit” written by members of the Kimball Group.  This is a great book and helped me immensely to get started with DW/BI using the Microsoft BI tool stack.  I would definitely recommend it to anyone thinking of getting into the DW/BI space especially those focused on the MS BI tool stack.

 

Since 2006, I have purchased every book that the Kimball Group has published.  The books contain great information  based off of the Kimball Group’s methodologies and experiences in building effective DW/BI solutions.  In addition to the many books I purchase yearly, I try and attend one paid in person training event each year. 

 

I feel that continuing ones education throughout your career is as important as when you are starting out.  Early of each year I plan and budget for an in person training event.  The last two years I’ve had the opportunity to attend the PASS Summit as a result of being President of the Denver SQL Server User Group.  This is a great event not only for the large amount of content but also for building ones professional network. 

 

While I was doing some research on different training events I found that the Kimball Group through the Kimball University would be holding a Microsoft Data Warehouse in Depth seminar here in Denver.  Having read, or at least perused all of their books, I thought this would be a great event and decided to attend it over the PASS Summit.

 

The Microsoft Data Warehouse in Depth seminar was held last week.  Here is a brief outline of the seminar.

The seminar’s outline mimics that of the book starting out with the DW/BI project life cycle, project planning and requirements gathering.  They continued into dimensional modeling, system architecture and setup.  Given that this has a Microsoft BI tool specific focus, SQL Server Integration Services (SSIS) within a DW environment is discussed.  They discussed setting up auditing, utilizing configuration files for package portability, error handling as well as loading methods for dimensions and fact tables.

 

The next focus of the seminar was on report development with SQL Server Reporting Services (SSRS)  and Excel.  Report identification and requirements lead this section with the intent of ensuring that the reports being developed are or will be useful to the business users as well as manageable from on IT perspective.

 

Following the dimensional modeling and ETL was discussion on SQL Server Analysis Services (SSAS). A walk through identifying the major areas of the development of an SSAS project was provided.  Key dimension topics such as attribute relationships were covered along with a demo of how they are properly created.  Cube development in BIDS (Business Intelligence Development Studio) was covered at a high level including calculated members, partitions, KPI’s, etc.  Data mining was also covered briefly.

 

The last part was on the management of the DW/BI environment dealing with security, Deployment and monitoring.  Both tools and techniques to perform the needed tasks were discussed and a few demoed.

 

Overall, the seminar was geared towards an intro level to provide a broad scope of an entire DW/BI project using the Microsoft BI tool stack.  I would recommend this seminar to anyone entering into the DW/Bi space or up to the Junior level (1/2 to 1 year of experience) in all of the three main pillars of the Microsoft BI Tool Stack (SSIS, SSAS, SSRS).  One thing that surprised me was the distances that many attendees traveled to attend.  There were several folks from the East Coast, South and even a couple from Canada.

Monday, September 20, 2010 7:32:00 AM Categories: DW/BI SSAS SSIS SSRS Training

Denver SQL Server UG September meeting 

For the past 1 1/2 years now we’ve been hosting two presentations each meeting.  The first presentation is typically a 30 minute intro level topic and the second a 60 minute more advanced topic.  This month we are giving something else a go.  We are hosting two 60 minute presentations.  The first topic is “Turbocharge SQL Server Insert Performance with the Hidden SqlCommandSet Class” being presented by Brooke Philpott of SQL Sentry.  The second presentation is “Efficient Operational Data Replication (ODR) with the Attunity CDC Suite for SSIS” by Doug Wheaton of AttunityAttunity is also the featured sponsor for this month.

 

Here is more detail:

SQLSentry02-72

The first presentation, "Turbocharge SQL Server Insert Performance with the Hidden SqlCommandSet Class" will be given by Brooke Philpott of SQL Sentry.


SQL Server provides multiple ways to insert data using ADO.NET. We’ll explore individual writes as well as batched writes in non-RPC and RPC form, with a focus on using the internal SqlCommandSet class to get as close as possible to bulk-copy performance, without the restrictions. Along the way we’ll look at the impact of the various methods on the SQL Server plan cache, and address ways to prevent plan cache bloat when using high-volume inserts, ultimately arriving at a method that is very fast and has minimal impact on the plan cache.


Brooke Philpott is a Senior Technical Lead at SQL Sentry, Inc. Brooke is the lead developer of SQL Sentry’s award-winning software products. He was also part of the DBA team managing the largest SQL Server hosting shop in the Southeastern United States. Brook has BS in Mathematics from Davidson College. His hobbies include cooking, photography, music, and video games. You can read his blog here: http://brooke.blogs.sqlsentry.net/

 

attunitylogo

The second presentation, "Efficient Operational Data Replication (ODR) With the Attunity CDC Suite for SSIS" will be given by Doug Wheaton of Attunity.


During this presentation you will learn how to leverage your SQL Server skills and software to reduce cost and accelerate operational data replication (ODR), synchronization, and real-time integration of heterogeneous data sources including SQL Server, Oracle, and DB2. Doug will explain CDC (change data capture) and ODR concepts and benefits as well as how you can use them to enable operational reporting and business intelligence or to maintain data consistency across systems.


Doug Wheaton is a Senior Software Engineer at Attunity and has over 20 years of experience in the content, workflow management and data integration space. He has written numerous related articles and has presented at many Microsoft conferences throughout the US on the topic of data integration using Microsoft SSIS.

 


Location:

AmeriTeachlogo
AmeriTeach
7800 East Dorado Place, Suite 150
Greenwood Village, CO 80111
720-346-1710


Meeting Agenda:
5:30 - 6:00 - Food / Socializing
6:00 - 6:10 - User Group Business
6:10 - 6:15 - Open Panel/Questions & Answers
6:15 - 7:15 - Presentation 1
7:15 - 8:15 - Presentation 2
8:15 - 8:20 - Give Away / Wrap-Up

 

And of course: SQL Saturday #52 – Colorado is a training event for SQL Server professionals and those wanting to learn about SQL Server. Admittance to this event is free with the exception of a $10 lunch fee; all other costs are covered by donations and sponsorships. Please also let friends and colleagues know about the event.
Registration for the 9/25 event is now open at: http://www.sqlsaturday.com/52/register.aspx

Thursday, September 16, 2010 3:32:00 AM Categories: Community DenverSQL SSIS