Google Cloud Bigtable: Storing multiple rows for a row key - google-cloud-platform

In Bigtable, I am trying to create a column family corresponding to a row key in this format shown below.
Under the preferences column, there are multiple cells. Note that these are not multiple versions of the same cell, but multiple cells in a column corresponding to same row key.
Access patterns include:
reading all the preferences of user(RK)
reading the beta preference of a user
and so on.
How do I create a column family in this schema?

The most straightforward option is to create a column family called "preferences" with columns named "alpha", "beta", "gamma", etc. This structure is compatible with both reading all preferences (just read all columns) or a single preference (use a column filter).

Related

How do I create a Measure that will combine data from unrelated tables to be used as a Value in a chart?

I have multiple fact tables due to the way my company splits document types. I have created dummy data simulating these documents. They are "Survey", "Complaints", and "Returns". Survey and Return have their own document type, "F4" and "F5" respectively, which are serialized by a 12 digit number then they are broken down to a defect code in another column. Then "Survey" combines those serial numbers into one table, but does not have the defect code. I would like to either find a way to combine these tables for visualization or find a DAX formula that will combine the columns from each table and display the count of each document type. So the visualization I would like will be:
Splicer with Customer Key, then bar chart that will show the count of document type for "F4" and "F5" and then a column chart that will show the defect code count of the document types. I have attached a link to the OneDrive with some dummy data. Thanks for any help. Dummy Data Link

Manipulating .xls columns and rows with Open Refine

I need to manipulate a data set such that it can be mapped with Google Fusion Tables. Current xls data is formatted as follows:
Image of xls file with personal data anonymized
Note that a blank row indicates a new entry. I need the information in the column to be sorted into a rows under the appropriate heading, specifically the address for geocoding. Any ideas?
First, do some clean up to merge your second and third column into a single one and then use the feature Columnize by key/value column to transpose data in the third and fourth columns into separate fields.
Once this done, Fusion table should be able to geocode the dataset based on the address. If it is not the case, there is plenty of tutorials to geocode a dataset with OpenRefine. See:
OpenRefine wiki,
Google Maps,
OpenStreet Map,
Yahoo Maps.

Comparing 2 Tables in PowerBI

Working on a way to compare 2 tables in PowerBI.
I'm joining the 2 tables using the primary key and making custom columns that compare if the old and new are equal.
This doesn't seem like the most efficient way of doing things, and I can't even color code the matrix because some values aren't integers.
Any suggestions?
I did a big project like this last year, comparing two versions of a data warehouse (SQL database).
I tackled most of it in the Query Editor (actually using Power Query for Excel, but that's the same as PBI's Query Editor).
My key technique was to first create a Query for each table, and use Unpivot Other Columns on everything apart from the Primary Key columns. This transforms it into rows of Attribute, Value. You can filter Attribute to just the columns you want to compare.
Then in a new Query you can Merge & Expand the "old" and "new" Queries, joining on on the Primary Key columns + the Attribute column. Then add Filter or Add Column steps to get to your final output.

Google charts table random display

I have multiple tables in my app. In the app I cannot control the numbers of columns of the strings lengths in the column. Also I cannot control the numbers of columns of the orders of the columns.
Right now i have 4 tables in a set and one table in another set. I will put only the headers as the other data is confidential in this image.
The problem I encounter is: since the string length in a column in the table is not the same as the same column in other table. The width of the columns are not the same within a table set. I have tried to check on page load the length of the columns but the tables is not drawn on page load. When i say a table set i refer a list of tables with same properties, same columns number and name.
I cannot set fixed size on a certain column because I am not able to know if for next table set how the columns distribution is.
How can I make all the columns to look alike? How can I catch event when the table is fully drawn?

SAS Enterprise Miner split Dataset by binary variable

I am using the SAS Enterprise Miner 13.2.
I have a SAS table as a data source. In this table i have a binary variable D_TYP ( "I" and "P" ) and other categorical variables.
I want to split the data by D_TYP so i got two tables. One with all "I" and the other with "P". The problem i don’t know how.
I have been looking in the taskbar and i tried Filter and Data Partition. I can probably use SAS Code to split the Data but i think there is an other way with the taks.
You could use two filter nodes to do the job, with one filtering out I and the another filtering out P. The resulted data set should only consist of one type of the binary variable. In case you are not familiar with the filter node, click on the option Class Variable at properties panel and apply User specified filter. You have to manually select the group by clicking on its corresponding bar.