Can you use a SAS dataset as a SSIS data source? - sas

We have a some ETL processes that read CSV files that are output from SAS programs. I'm in the process of upgrading one of these ETLs and was wondering if I could use SSIS to read directly from the SAS dataset.
Has anybody done this successfully?

See here
"You can use SAS Local Data Provider (can be downloaded separately and comes as part of SAS for Windows installation). "

Recently I've moved data from SAS environment:
In connection manager I choose _Native OLE DB\SAS Local Data Provider 9.3_
Enter file name (`\..\dev` before the table name). Click OK
Drag _OLE DB Source_ into the Data Flow
Right click on _OLE DB Source_ and choose _Show advanced editor_
In the first tab (_Connection Manager_) choose _SAS Connection Manager_ from drop down list you created now
In the Component Properties tab in _OpenRowset_ write the name of the table, click OK
If you have `datetime` type transform it using Derived Transformation Editor

For loading SAS XPT or SAS7BDAT data files without having an instance of SAS to connect to via OBDC we used the following
A third party tool (STATTransfer) to read the XPT file
STATTransfer ODBC driver
Setup the connection in SSIS as an ODBC datasource and load into the database for processing.
There are SAS datasource SSIS extension available http://www.cozyroc.com offer a SAS Data connection, but they where outside our price range

Related

Adding static Excel to automatically refreshing Power BI report

I have an existing PowerBI report that imports data from an SQL Server analytics services database. This is working fine and I can schedule automatic refreshes using the Gateway provided by my organization.
I would now like to add some additional, but rarely changing data, that I only have in a local Excel file. When I do add this data, the report stops refreshing automatically and complains, that it has no gateway to refresh this Excel file.
What I would like is that Power BI is refreshing the data of the SQL Server analytics services database, but just keeps the existing Excel file without updating it. - I will upload an updated version of the PowerBI report if I need to change the data in the Excel file.
Is that possible? I couldn't find out how. I was trying to upload the Excel file to a different dataset to the Power BI service and reference this dataset in my report. Just to find out, that I cannot access a different Power BI dataset and SQL server analysis services database from the same report.
Three things I can think of
Upload the file to onedrive/sharepoint so that it's accessible online (per Dev's answer)
If the data is simple enough, you can add the data directly into PowerBI itself and skip the Excel file entirely.
You can disable the Excel file refresh so that PBI does not try and refresh(and thus access) the local Excel file. (Not sure if this will work)
I had a similar issue I came across. Yes, you can just use Enter Data to add a table, but you can only build something with less than 3000 cells, so you'd have to merge several tables if something was larger than that.
Turning off the report refresh in the suggestion above (#3) still requires a gateway, unfortunately.
I just created a dataflow and plopped the data from my csv there. You'll have to create a connection and refresh it, but you don't need to schedule a refresh there, so no need to create a gateway.
Then just link the dataflow as a source to your .pbix file and setup your gateway to point at the dataflow.

Framework selection for a new project?

Problem Context
We have a set of excel reports which are generated from an excel input provided by the user and then fed into SAS for further transformation. SAS pulls data from Teradata database and then there is a lot of manipulation that happens with the input data & data pulled from Teradata. Finally, a dataset is generated which can either be sent to the client as a report, or be used for populating Tableau dashboard. Also the database is being migrated from Teradata to Google Cloud (Big Query EDW) as the Teradata pulls from SAS used to take almost 6-7 hours
Problem Statement
Now we need to automate this whole process, by creating front end for the user to upload the input files and from there on the process should trigger and in the end the user should receive the excel file or Tableau dashboard as an attachment in a mail.
Can you suggest what technologies should be used in the front end & middle tier to make this process feasible is least possible time with google cloud platform as the backend?
Can an R shiny front end be a solution given that we need to communicate with a Google Cloud backend ?
I have got suggestion from people that Django will be a good framework to accomplish this task. What are your views on this ?

SAS EG - Open a table without creating table link in process flow

This seems rather trivial, but in SAS Enterprise Guide, is there a way to simply open a SAS dataset from a SAS library without having a shortcut added automatically to the process flow?
For SAS EG 5.1, you need to do the following:
Right click on data set
Select Explore
Go to "Data Explorer"
Double click on data set that you want to explore
Once it is in the Data Explorer, you don't need to do steps 1-3 again for that dataset.
In EG 7 (I might be able to find and older version if you don't have 7):
Tools -> Options -> Data/Data General
Under "Default Action for Data in Server List and SAS Folders", check "Explore".
From the documentation included, that "opens data sets in the Date Explorer when you double-click then in the Server List and SAS Folders windows. The Data Explorer enables you to view a SAS Data Set without adding the data set to your project."

Connecting to SAS dataset using OLE DB in QlikView

I'm trying to use an OLEDB connection to connect to my SAS datasets from QlikView.
I am able to set up the connection, but I am not able to load my data. Executing the data import results in the following error: "the local provider does not currently support sql processing"
My loading code looks like this:
LOAD "account",
balance,
"open_closed";
SQL SELECT *
FROM "BASE_DATA";
Is there any way to solve this?
We read SAS data into QlikView using SAS ODBC driver that connects to SAS/SHARE.
This requires that you have one of the following three:
1. SAS/SPD Server
2. SAS/SHARE Server
3. SAS locally on desktop (will not work on server license)

Anyone using a web service as a data source in Excel 2007?

Can I use a web service as a data source for creating Excel pivot tables?
Currently, the soure data for the pivot table is being exported from our SQL db to a CSV file. Then, the CSV file is loaded into a worksheet. From there, a pivot table is created in the same workbook.
Customers login to a website, click some links, and an excel file (with data and pivot table) is generated. This is a public app so the preference is to not connect directly to the DB.
We control the database and generate the output. We are looking to streamline this process. The SQL db and pivot tables can not / will not change.
See http://www.vertex42.com/News/excel-web-query.html
What format does the "public-facing website" use in making the data available? A data file, a table on a web page? This issue will determine how much of a scraping operation you'll need to do.
You'll still need to write the web service and have it run on a server. A possible alternative is to use Yahoo Pipes to do the conversions for you.