I am designing an application which will involve bulk upload of records to a Postgres DB (Lets call the schema DB-1). The uploads will be done almost every week. Size could range from a few million to a billion records. The data that is going to be u
I am designing a data mart for University students and confused about visa and passport information that should it go in the student dimension or should I define a separate dimension for it. Which would be the better approach?
I'm new on dimensional modelling I believe that you guys can help me in the following doubts. In the production system I have a transaction table, sales table for example.The unique identifier is a primary key called SaleId. Example: My doubt is when
I'm building a Data warehouse for my work and wanted to know if you knew of good resources and examples on actually building them in a Oracle enviroment. I have Ralph Kimball's book 'The data Warehouse toolkit' allready and I did take a class in my d
How can we load fact table in star schema using informatica powercenter ? Can you please provide any example for mappings/tranformations for this. --------------Solutions------------- Take the Staging tables as source tables and take the dimensions a
I need to create a dimensional environment for sales analysis for a retail company. The hierarchy that will be present in my Sales fact is: 1 - Country 1.1 - Region 1.1.1 - State 188.8.131.52 - City 184.108.40.206.1 - Neighbourhood 220.127.116.11.1.1 - Store 18.104.22.168.1.
I have a SQL Server 2012 table that will contain 2.5 million rows at any one time. Items are always being written into the table, but the oldest rows in the table get truncated at the end of each day during a maintenance window. I have .NET-based rep
I'm creating a data warehouse for a healthcare company. They have separate databases for different hospitals which contain tables on patients,their insurance,etc and PK is unique only within one hospital DB. When merged, I'm supposed to create a Mast
Can Dimension Table became a fact table as well? For instance, I have a Customer dimension table with standard attributes such as name, gender, etc. I need to know how many customers were created today, last month, last year etc. using SSAS. I could
I have to create a mixture of MDX and TSQL as follows: select "[State].[Country].[Country].[MEMBER_CAPTION]" as State, "[Measures].[someMeasure]" as [Sum] from openquery(my_olap_server, 'select [Measures].[someMeasure] on 0, filter([State].[Country].
Background I am designing a Data Warehouse with SQL Server 2012 and SSIS. The source system handles hotel reservations. The reservations are split between two tables, header and header line item. The Fact table would be at the line item level with so
I am a beginner to DataWarehousing. We have created a data mart, a star schema design to load quarterly data. We have been loading the current data as and when approved by the business for that quarter. Now we have a requirement to go back and load h
I am using SAP BW, I have to write a transformation from 0FI_GL_4 datasource, I want to find out the accounting document whose line item is in two G/L account? for example, if the accounting document '123456' , have 4 line items, and 1 line item's G/
I want to set up a fact table for restaurant sales transactions. Adding up the entire fact table will give the entire sales across the restaurant(s). The restaurant has two main sources of revenue - food and beverage. The dimensions for each are very
My understanding says that the dimensions should be extracted first, then the facts should be extracted. That way, the foreign keys will still be honoured in staging area. While loading, the same sequencing should be used, for the same obvious reason
I am trying to convert a Vertica view definition to Teradata. I encountered a where clause in Vertica which goes like Where ( ColumnA or Column B); I am not sure how this works as there is no comparison. Any Ideas ?? --------------Solutions----------
I am learning stuff so that i can enter data warehousing field. I was reading the book on DW and it says knowledge of spreadsheets will be good for DW. I have some time left before applying for jobs. Should i start learning microsoft excel in advance
I want to work in data warehousing and data analyst jobs. I am reading books on data mining and warehousing . But i am going mad by the techincal math stuff like probability , fourier transform and wavelet function. I am not very good at those statis
If there is an Identity column in Vertica that has no parameters defined, how does it work ? CREATE MULTISET TABLE db.user_state ( active_user_state_key IDENTITY , load_key int NOT NULL ) For example in above code, where will the Identity column star
I am designing a movie rental data warehouse I want the fact table to consist of movie rentals/returns but I'm getting confused. The movies can be returned at any store so I need to show that. I have these dimensions: time, customerinfo, movie info ,