SQL Troubles: permissions

Showing posts with label permissions. Show all posts

13 March 2025

🏭🗒️Microsoft Fabric: Workspaces [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 25-Mar-2025

[Microsoft Fabric] Workspace

{def} a collection of items that brings together different functionality in a single environment designed for collaboration
{default} created in organization's shared capacity

workspaces can be assigned to other capacities

includes My Workspaces
via Workspace settings >> Premium

shared environment that holds live items

any changes made directly within the workspace immediately take effect and impact all users

components

header

contains

name
brief description of the workspace
links to other functionality

toolbar

contains

controls for managing items to the workspace
controls for managing files

view area

enables selecting a view
{type} list view

{subitem} task flow

area in which users can create or view a graphical representation of the data project [3]

⇐ shows the logical flow of the project [3]

⇐ it doesn't show the flow of data [3]

can be hided via Show/Hide arrows

{subitem} items list

area in which the users can see the items and folders in the workspace [3]
one can filter the items list by selecting the tasks, if any defined [3]

{subitem} resize bar

elements that allow to resize the task flow and items list by dragging the resize bar up or down [3]

{type} lineage view

shows the flow of data between the items in the workspace [3]

{feature} workspace settings

allows to manage and update the workspace [3]

{feature} contact list

allows to specify which users receive notification about issues occurring in the workspace [3]
{default} contains workspace's creator [3]

{feature} SharePoint integration

allows to configure a M365 Group whose SharePoint document library is available to workspace users [3]

⇐ the group is created outside of MF first [3]
restrictions may apply to the environment

{best practice} give access to the workspace to the same M365 Group whose file storage is configured [3]

MF doesn't synchronize permissions between users or groups with workspace access, and users or groups with M365 Group membership [3]

{feature} workspace identity

an automatically managed service principal that can be associated with a Fabric workspace [6]

workspaces with a workspace identity can securely read or write to firewall-enabled ADSL Gen2 accounts through trusted workspace access for OneLake shortcuts [6]
Fabric creates a service principal in Microsoft Entra ID to represent the identity [6]

⇐ an accompanying app registration is also created [6]
Fabric automatically manages the credentials associated with workspace identities [6]

⇒ prevents credential leaks and downtime due to improper credential handling [6]

used to obtain Microsoft Entra tokens without the customer having to manage any credentials [6]

Fabric items can use the identity when connecting to resources that support Microsoft Entra authentication [6]

can be created in the workspace settings of any workspace except My workspaces
automatically assigned to the workspace contributor role and has access to workspace items [6]

{feature} workspace roles

allows to manage who can do what in a workspace [4]
sit on top of OneLake and divide the data lake into separate containers that can be secured independently [4]
extend the Power BI workspace roles by associating new MF capabilities

e.g. data integration, data exploration

can be assigned to

individual users
security groups
Microsoft 365 groups
distribution lists

{role} Admin
{role} Member
{role} Contributor
{role} Viewer
user groups

members get the role(s) assigned
users existing in several group get the highest level of permission that's provided by the roles that they're assigned [4]
{concept} [nested group]

{concept} current workspace

the active open workspace

{action} create new workspace
{action} pin workspace
{action} delete workspace

everything contained within the workspace is deleted for all group members [3]

the associated app is also removed from AppSource [3]

{warning} if the workspace has a workspace identity, that workspace identity will be irretrievably lost [3]

this may cause Fabric items relying on the workspace identity for trusted workspace access or authentication to break [3]

only admins can perform the operation

{action} manage workspace
{action} take ownership of Fabric items

Fabric items may stop working correctly [5]

{scenario} the owner leaves the organization [5]
{scenario}the owner don't sign in for more than 90 days [5]
in such cases, anyone with read and write permissions on an item can take ownership of the item [5]

become the owner of any child items the item might have
{limitation} one can't take over ownership of child items directly [5]

⇐ one can take ownership only through the parent item [5]

{limitation} workspace retention

personal workspace

{default} 30 days [8]

collaborative workspace

{default} 7 days [8]
configurable between 7 to 90 days [8]

via Define workspace retention period setting [8]

during the retention period, Fabric administrators can restore, respectively delete permanently the workspace [8]

after that, the workspace is deleted permanently and it and its contents are irretrievably lost [8]

{limitation} can contain a maximum of 1000 items [8]

includes both parent and child items [8]

refers to Fabric and Power BI items [8]

workspaces with more items have 180-day extension period applied automatically as of 10-Apr-2025 [8]

{limitation} certain special characters aren't supported in workspace names when using an XMLA endpoint [3]
{limitation} a user or a service principal can be a member of up to 1000 workspaces [3]
{feature} auditing

several activities are audited for workspaces [3]

CreateFolder
DeleteFolder
UpdateFolder
UpdateFolderAccess

{feature} workspace monitoring

Eventhouse secure read-only database that collects and organizes logs and metrics from a range of Fabric items in the workspace [1]

accessible only to workspace users with at least a contributor role [1]
users can access and analyze logs and metrics [1]
the data is aggregated or detailed [1]
can be queried via KQL or SQL [1]
supports both historical log analysis and real-time data streaming [1]
accessible from the workspace [1]

one can build and save query sets and dashboards to simplify data exploration [1]

use the workspace settings to delete the database [1]

wait about 15 minutes before recreating a deleted database [1]

{action} share the database

users need workspace member or admin role [1]

{limitation} one can enable either

workspace monitoring
log analytics

if enabled, the log analytics configuration must be deleted first before enabling workspace monitoring [1]

one should wait for a few hours before enabling workspace monitoring [1]

{limitation} retention period for monitoring data: 30 days [1]
{limitation}the ingestion can't be configured to filter for specific log type or category [1]

e.g. error or workload type.

{limitation} user data operation logs aren't available even though the table is available in the monitoring database [1]
{prerequisite} Power BI Premium or Fabric capacity [1]
{prerequisite} workspace admins can turn on monitoring for their workspaces tenant setting is enabled [1]

enabling the setting, requires Fabric administrator rights [1]

{prerequisite} admin role in the workspace [1]

workspace permissions

the first security boundary for data within OneLake [7]

each workspace represents a single domain or project area where teams can collaborate on data [7]
security is managed through Fabric workspace roles [7]

items can have permissions configured separately from the workspace roles [7]

permissions can be configured either by [7]

sharing an item
managing the permissions of an item

Previous Post <<||>> Next Post

References:
[1] Microsoft Learn (2024) Fabric: What is workspace monitoring (preview)? [link]

[2] Microsoft Fabric Update Blog (2024) Announcing preview of Workspace Monitoring? [link]
[3] Microsoft Learn (2024) Fabric: Workspaces in Microsoft Fabric and Power BI [link]
[4] Microsoft Learn (2024) Fabric: Roles in workspaces in Microsoft Fabric [link]
[5] Microsoft Learn (2024) Fabric: Take ownership of Fabric items [link]
[6] Microsoft Learn (2024) Fabric: Workspace identity [link]
[7] Microsoft Learn (2024) Fabric: Role-based access control (RBAC) [link]
[8] Microsoft Learn (2024) Fabric: Manage workspaces [link]

Resources:

[R1] Microsoft Learn (2025) Fabric: What's new in Microsoft Fabric? [link]

Acronyms:
ADSL Gen2 - Azure Data Lake Storage Gen2
KQL - Kusto Query Language
M365 - Microsoft 365
MF - Microsoft Fabric

SQL - Structured Query Language

11 March 2025

🏭🎗️🗒️Microsoft Fabric: Real-Time Dashboards (RTD) [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 10-Mar-2025

Real-Time Intelligence architecture [5]

[Microsoft Fabric] Real-Time Dashboard

[def]

a collection of tiles

optionally organized in pages

act as containers of tiles
organize tiles into logical groups

e.g. by data source or by subject area

used to create a dashboard with multiple views

e.g. dashboard with a drillthrough from a summary page to a details page [1]

each tile has

an underlying query
a visual representation

exists within the context of a workspace [1]

always associated with the workspace used to create it [1]

{concept} tile

uses KQL snippets to retrieve data and render visuals [1]
can be added directly from queries written in a KQL queryset [1]

{concept} data source

reusable reference to a specific database in the same workspace as the dashboard [1]

{concept} parameters

significantly improve dashboard rendering performance [1]
enable to use filter values as early as possible in the query

filtering is enabled when the parameter is included in the query associated with the tiles [1]

{concept} cloud connection

uses dashboard owner's identity to give access to the underlying data source to other users [2]
when not used for 90 days, it will expire [2]

⇒ a new gateway connection must be set up [2]
via Manage connections >> Gateways page >> Edit credentials and verify the user again

a separate connection is needed for each data source [2]

{feature} natively export KQL queries to a dashboard as visuals and later modify their underlying queries and visual formatting as needed [1]

the fully integrated dashboard experience provides improved query and visualization performance [1]

{feature} encrypted at rest

dashboards and dashboard-related metadata about users are encrypted at rest using Microsoft-managed keys [1]

{feature} auto refresh

allows to automatically update the data on a dashboard without manually reloading the page or clicking a refresh button [1]
can be set by a database editor

both editors and viewers can change the actual rate of auto refresh while viewing a dashboard [1]

database editors can limit the minimum refresh rate that any viewer can set

⇐ reduces the cluster load
when set, database users can't set a refresh rate lower than the minimum [1]

{feature} explore data

enables users to extend the exploration of dashboards beyond the data displayed in the tiles [3]

begins with viewing the data and its corresponding visualization as they appear on the tile [3]
users can add or removing filters and aggregations, and use further visualizations [3]

⇐ no knowledge of KQL is needed [3]

{feature} conditional formatting

allows users to format data points based on their values, utilizing

colors

{rule} color by condition

allows to set one or more logical conditions that must be met for a value to be colored [4]
available for table, stat, and multi stat visuals [4]

{rule} color by value

allows to visualize values on a color gradient [4]
available for table visuals [4]

tags
icons

can be applied either

to a specific set of cells within a designated column [4]
to entire rows [4]

one or more conditional formatting rules can be applied for each visual [4]

when multiple rules conflict, the last rule defined takes precedence over any previous ones [4]

{action} export dashboard

dashboards can be exported to a JSON file
can be useful in several scenarios

{scenario} version control

the file can be used to restore the dashboard to a previous version [1]

{scenario} dashboard template

the file can be used as template for creating new dashboards [1]

{scenario} manual editing
edit the file to modify the dashboard and imported the file back to the dashboard [1]

ADX dashboards can be exported and imported as RT dashboards [6]

{action} share dashboard

one can specify if the user can view, edit, or share [2]
⇐ the permissions are not for the underlying data [2]

permissions are set by defining the identity that the dashboard uses for accessing data from each data sources[2]
{type|default} pass-through identity

used when authenticating to access the underlying data source [2]
the user is only able to view the data in the tiles [2]

{type} dashboard editor’s identity:

allows the user to use editor’s identity, and thus permissions[2]

the editor defines a cloud connection that the dashboard uses to connect to the relevant data source [2]
only editors can define cloud connections and permissions for a specific real-time dashboard [2]

each editor that modifies the real-time dashboard needs to set up own cloud connection [2]
if a valid connection doesn't exist, the user is able to view the real-time dashboard but will only see data if they themselves have access to it [2]

{action} revoke a user’s access permissions

remove access from the dashboard [2]
remove the cloud connection.

via Settings >> Manage connections and gateways >> Options >> Remove

remove the user from the cloud connection.

via Settings >> Manage connections and gateways >> Options >> Manage users >> {select User} >> Delete

edit the Data source access permissions.

via Data source >> New data source >> edit >> Data source access >> Pass-through identity
⇐ the user uses own identity to access the data source [2]

{prerequisite} a workspace with a Microsoft Fabric-enabled capacity [1]
{prerequisite} a KQL database with data [1]
{setting} Users can create real-time dashboards [1]

Previous Post <<||>> Next Post

References:

[1] Microsoft Learn (2024) Fabric: Create a Real-Time Dashboard [link]

[2] Microsoft Learn (2024) Fabric: Real-Time Dashboard permissions (preview) [link]

[3] Microsoft Learn (2024) Fabric: Explore data in Real-Time Dashboard tiles [link]
[4] Microsoft Learn (2024) Fabric: Apply conditional formatting in Real-Time Dashboard visuals [link]

[5] Microsoft Learn (2025) Fabric: Real Time Intelligence L200 Pitch Deck [link]
[6] Microsoft Fabric Updates Blog (2024) Easily recreate your ADX dashboards as Real-Time Dashboards in Fabric, by Michal Bar [link]
[7] Microsoft Learn (2025) Create Real-Time Dashboards with Microsoft Fabric [link]

Resources:

[R1] Microsoft Learn (2024) Microsoft Fabric exercises [link]
[R2] Microsoft Fabric Updates Blog (2024) Announcing Real-Time Dashboards generally available [link]

[R3] Microsoft Learn (2025) Fabric: What's new in Microsoft Fabric? [link]

Acronyms:
ADX - Azure Data Explorer
KQL - Kusto Query Language
MF - Microsoft Fabric
RT - Real-Time

22 January 2025

🏭🗒️Microsoft Fabric: Clone Tables in Warehouses [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 22-Jan-2025

[Microsoft Fabric] Zero-copy Clone

{def} a replica of an existing OneLake table created by copying existing table's metadata and referencing its data files [1]

the metadata is copied while the underlying data of the table stored as parquet files is not copied [1]
its creation is like creating a delta table [1]
DML/DDL changes on the source

are not reflected in the clone table [1]
are not reflected on the source [1]

can be created within or across schemas in a warehouse [1]
created based on either:

current point-in-time

based on the present state of the table [1]

previous point-in-time

based on a point-in-time up to seven days in the past

the table clone contains the data as it appeared at a desired past point in time
all CRUD operations are retained for seven calendar days

created with a timestamp based on UTC

{characteristic} autonomous existence

the original source and the clones can be deleted without any constraints [1]
once a clone is created, it remains in existence until deleted by the user [1]

{characteristic} inherits

object-level SQL security from the source table of the clone [1]

DENY permission can be set on the table clone if desired [1]

the workspace roles provide read access by default [1]

all attributes that exist at the source table, whether the clone was created within the same schema or across different schemas in a warehouse [1]
the primary and unique key constraints defined in the source table [1]

a read-only delta log is created for every table clone that is created within the Warehouse [1]
{benefit} facilitates development and testing processes

by creating copies of tables in lower environments [1]

{benefit} provides consistent reporting and zero-copy duplication of data for analytical workloads and ML modeling and testing [1]
{benefit} provides the capability of data recovery in the event of a failed release or data corruption by retaining the previous state of data [1]
{benefit} helps create historical reports that reflect the state of data as it existed as of a specific point-in-time in the past [1]
{limitation} table clones across warehouses in a workspace are not currently supported [1]
{limitation} table clones across workspaces are not currently supported [1]
{limitation} clone table is not supported on the SQL analytics endpoint of the Lakehouse [1]
{limitation} clone of a warehouse or schema is currently not supported [1]
{limitation} table clones submitted before the retention period of seven days cannot be created [1]
{limitation} cloned tables do not currently inherit row-level security or dynamic data masking [1]
{limitation} changes to the table schema prevent a clone from being created prior to the table schema change [1]
{best practice} create the clone tables in dedicated schema(s)
[syntax] CREATE TABLE <schema.clone_table_name> AS CLONE OF <schema.table_name>

Previous Post <<||>> Next Post

References:
[1] Microsoft Learn (2023) Clone table in Microsoft Fabric [link]
[2] Microsoft Learn (2024) Tutorial: Clone tables in the Fabric portal [link]
[3] Microsoft Learn (2024) Tutorial: Clone a table with T-SQL in a Warehouse [link]
[4] Microsoft Learn (2024) SQL: CREATE TABLE AS CLONE OF [link]

Resources:

[R1] Microsoft Learn (2025) Fabric: What's new in Microsoft Fabric? [link]

🏭🗒️Microsoft Fabric: Folders [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 22-Jan-2025

[Microsoft Fabric] Folders

{def} organizational units inside a workspace that enable users to efficiently organize and manage artifacts in the workspace [1]
identifiable by its name

{constraint} must be unique in a folder or at the root level of the workspace
{constraint} can’t include certain special characters [1]

C0 and C1 control codes [1]
leading or trailing spaces [1]
characters: ~"#.&*:<>?/{|} [1]

{constraint} can’t have system-reserved names

e.g. $recycle.bin, recycled, recycler.

{constraint} its length can't exceed 255 characters

{operation} create folder

can be created in

an existing folder (aka nested subfolder) [1]

{restriction} a maximum of 10 levels of nested subfolders can be created [1]
up to 10 folders can be created in the root folder [1]
{benefit} provide a hierarchical structure for organizing and managing items [1]

the root

{operation} move folder
{operation} rename folder

same rules applies as for folders’ creation [1]

{operation} delete folder

{restriction} currently can be deleted only empty folders [1]

{recommendation} make sure the folder is empty [1]

{operation} create item in folder

{restriction} certain items can’t be created in a folder

dataflows gen2
streaming semantic models
streaming dataflows

⇐ items created from the home page or the Create hub, are created at the root level of the workspace [1]

{operation} move file(s) between folders [1]
{operation} publish to folder [1]

Power BI reports can be published to specific folders

{restriction} folders' name must be unique throughout an entire workspace, regardless of their location [1]

when publishing a report to a workspace that has another report with the same name in a different folder, the report will publish to the location of the already existing report [1]

{limitation}may not be supported by certain features

e.g. Git

{recommendation} use folders to organize workspaces [1]
{permissions}

inherit the permissions of the workspace where they're located [1] [2]
workspace admins, members, and contributors can create, modify, and delete folders in the workspace [1]
viewers can only view folder hierarchy and navigate in the workspace [1]

[deployment pipelines] deploying items in folders to a different stage, the folder hierarchy is automatically applied [2]

Previous Post <<||>> Next Post

References:
[1] Microsoft Fabric (2024) Create folders in workspaces [link]
[2] Microsoft Fabric (2024) The deployment pipelines process [link]
[3] Microsoft Fabric Updates Blog (2025) Define security on folders within a shortcut using OneLake data access roles [link]
[4] Microsoft Fabric Updates Blog (2025) Announcing the General Availability of Folder in Workspace [link]
[5] Microsoft Fabric Updates Blog (2025) Announcing Folder in Workspace in Public Preview [link]
[6] Microsoft Fabric Updates Blog (2025) Getting the size of OneLake data items or folders [link]

Resources:

[R1] Microsoft Learn (2025) Fabric: What's new in Microsoft Fabric? [link]

20 January 2025

🏭🗒️Microsoft Fabric: [Azure] Service Principals (SPN) [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 20-Jan-2025

[Azure] Service Principal (SPN)

{def} a non-human, application-based security identity used by applications or automation tools to access specific Azure resources [1]

can be assigned precise permissions, making them perfect for automated processes or background services

allows to minimize the risks of human error and identity-based vulnerabilities
supported in datasets, Gen1/Gen2 dataflows, datamarts [2]
authentication type

supported only by [2]

Azure Data Lake Storage
Azure Data Lake Storage Gen2
Azure Blob Storage
Azure Synapse Analytics
Azure SQL Database
Dataverse
SharePoint online

doesn’t support

SQL data source with Direct Query in datasets [2]

when registering a new application in Microsoft Entra ID, a SPN is automatically created for the app registration [4]

the access to resources is restricted by the roles assigned to the SPN

⇒ gives control over which resources can be accessed and at which level [4]

{recommendation} use SPN with automated tools [4]

rather than allowing them to sign in with a user identity [4]

{prerequisite} an active Microsoft Entra user account with sufficient permissions to

register an application with the tenant [4]
assign to the application a role in the Azure subscription [4]
⇐ requires Application.ReadWrite.All permission [4]

extended to support Fabric Data Warehouses [1]

{benefit} automation-friendly API Access

allows to create, update, read, and delete Warehouse items via Fabric REST APIs using service principals [1]
enables to automate repetitive tasks without relying on user credentials [1]

e.g. provisioning or managing warehouses
increases security by limiting human error

the warehouses thus created, will be displayed in the Workspace list view in Fabric UI, with the Owner name of the SPN [1]
applicable to users with administrator, member, or contributor workspace role [3]
minimizes risk

the warehouses created with delegated account or fixed identity (owner’s identity) will stop working when the owner leaves the organization [1]

Fabric requires the user to login every 30 days to ensure a valid token is provided for security reasons [1]

{benefit} seamless integration with Client Tools:

tools like SSMS can connect to the Fabric DWH using SPN [1]
SPN provides secure access for developers to

run COPY INTO

with and without firewall enabled storage [1]

run any T-SQL query programmatically on a schedule with ADF pipelines [1]

{benefit} granular access control

Warehouses can be shared with an SPN through the Fabric portal [1]

once shared, administrators can use T-SQL commands to assign specific permissions to SPN [1]

allows to control precisely which data and operations an SPN has access to [1]

GRANT SELECT ON <table name> TO <Service principal name>

warehouses' ownership can be changed from an SPN to user, and vice-versa [3]

{benefit} improved DevOps and CI/CD Integration

SPN can be used to automate the deployment and management of DWH resources [1]

⇐ ensures faster, more reliable deployment processes while maintaining strong security postures [1]

{limitation} default semantic models are not supported for SPN created warehouses [3]

⇒ features such as listing tables in dataset view, creating report from the default dataset don’t work [3]

{limitation} SPN for SQL analytics endpoints is not currently supported
{limitation} SPNs are currently not supported for COPY INTO error files [3]

⇐ Entra ID credentials are not supported as well [3]

{limitation} SPNs are not supported for GIT APIs. SPN support exists only for Deployment pipeline APIs [3]
monitoring tools

[DMV] sys.dm_exec_sessions.login_name column [3]
[Query Insights] queryinsights.exec_requests_history.login_name [3]
Query activity

submitter column in Fabric query activity [3]

Capacity metrics app:

compute usage for warehouse operations performed by SPN appears as the Client ID under the User column in Background operations drill through table [3]

Previous Post <<||>> Next Post

References:

[1] Microsoft Fabric Updates Blog (2024) Service principal support for Fabric Data Warehouse [link]

[2] Microsoft Fabric Learn (2024) Service principal support in Data Factory [link]

[3] Microsoft Fabric Learn (2024) Service principal in Fabric Data Warehouse [link]

[4] Microsoft Fabric Learn (2024) Register a Microsoft Entra app and create a service principal [link]

[5] Microsoft Fabric Updates Blog (2024) Announcing Service Principal support for Fabric APIs [link]

Resources:

[R1] Microsoft Learn (2025) Fabric: What's new in Microsoft Fabric? [link]

Acronyms:

ADF - Azure Data Factory

API - Application Programming Interface

CI/CD - Continuous Integration/Continuous Deployment

DMV - Dynamic Management View

DWH - Data Warehouse

SPN - service principal

SSMS - SQL Server Management Studio

17 January 2025

💎🏭SQL Reloaded: Microsoft Fabric's SQL Databases (Part VIII: Permissions) [new feature]

Data-based solutions usually target a set of users who (ideally) have restricted permissions to the functionality. Therefore, as part of the process are defined several personas that target different use cases, for which the permissions must be restricted accordingly.

In the simplest scenario the user must have access to the underlying objects for querying the data. Supposing that an Entra User was created already, the respective user must be given access also in the Fabric database (see [1], [2]). From database's main menu follow the path to assign read permissions:
Security >> Manage SQL Security >> (select role: db_datareader)

Manage SQL Security

Manage access >> Add >> (search for User)

Manage access

(select user) >> Share database >> (select additional permissions) >> Save

Manage additional permissions

The easiest way to test whether the permissions work before building the functionality is to login over SQL Server Management Studio (SSMS) and check the access using the Microsoft Entra MFA. Ideally, one should have a User's credentials that can be used only for testing purposes. After the above setup was done, the new User was able to access the data.

A second User can be created for testing with the maximum of permissions allowed on the SQL database side, which is useful for troubleshooting. Alternatively, one can use only one User for testing and assign or remove the permissions as needed by the test scenario.

It's a good idea to try to understand what's happening in the background. For example, the expectation was that for the Entra User created above also a SQL user is created, which doesn't seem to be the case, at least per current functionality available.

Before diving deeper, it's useful to retrieve User's details:

-- retrieve current user
SELECT SUser_Name() sys_user_name
, User_Id() user_id 
, USER_NAME() user_name
, current_user [current_user]
, user [user];

Output:

sys_user_name	user_id	user_name	current_user	user
JamesClavell@[domain].onmicrosoft.com	0	JamesClavell@[domain].onmicrosoft.com	JamesClavell@[domain].onmicrosoft.com	JamesClavell@[domain].onmicrosoft.com

Retrieving the current User is useful especially when testing in parallel functionality with different Users. Strangely, User's ID is 0 when only read permissions were assigned. However, a valid User identifier is added for example when to the User is assigned also the db_datawriter role. Removing afterwards the db_datawriter role to the User keeps as expected User's ID. For troubleshooting purposes, at least per current functionality, it might be a good idea to create the Users with a valid User ID (e.g. by assigning temporarily the db_datawriter role to the User).

The next step is to look at the Users with access to the database:

-- database access 
SELECT USR.uid
, USR.name
--, USR.sid 
, USR.hasdbaccess 
, USR.islogin
, USR.issqluser
--, USR.createdate 
--, USR.updatedate 
FROM sys.sysusers USR
WHERE USR.hasdbaccess = 1
  AND USR.islogin = 1
ORDER BY uid

Output:

uid	name	hasdbaccess	islogin	issqluser
1	dbo	1	1	1
6	CharlesDickens@[...].onmicrosoft.com	1	1	0
7	TestUser	1	1	1
9	JamesClavell@[...].onmicrosoft.com	1	1	0

For testing purposes, besides the standard dbo role and two Entra-based roles, it was created also a SQL role to which was granted access to the SalesLT schema (see initial post):

-- create the user
CREATE USER TestUser WITHOUT LOGIN;

-- assign access to SalesLT schema 
GRANT SELECT ON SCHEMA::SalesLT TO TestUser;
  
-- test impersonation (run together)
EXECUTE AS USER = 'TestUser';

SELECT * FROM SalesLT.Customer;

REVERT;

Notes:
1) Strangely, even if access was given explicitly only to the SalesLT schema, the TestUser User has access also to sys.sysusers and other DMVs. That's valid also for the access over SSMS
2) For the above created User there are no records in the sys.user_token and sys.login_token DMVs, in contrast with the user(s) created for administering the SQL database.

Let's look at the permissions granted explicitly:

-- permissions granted explicitly
SELECT DPR.principal_id
, DPR.name
, DPR.type_desc
, DPR.authentication_type_desc
, DPE.state_desc
, DPE.permission_name
FROM sys.database_principals DPR
     JOIN sys.database_permissions DPE
	   ON DPR.principal_id = DPE.grantee_principal_id
WHERE DPR.principal_id != 0 -- removing the public user
ORDER BY DPR.principal_id
, DPE.permission_name;

Result:

principal_id	name	type_desc	authentication_type_desc	state_desc	permission_name
1	dbo	SQL_USER	INSTANCE	GRANT	CONNECT
6	CharlesDickens@[...].onmicrosoft.com	EXTERNAL_USER	EXTERNAL	GRANT	AUTHENTICATE
6	CharlesDickens@[...].onmicrosoft.com	EXTERNAL_USER	EXTERNAL	GRANT	CONNECT
7	TestUser	SQL_USER	NONE	GRANT	CONNECT
7	TestUser	SQL_USER	NONE	GRANT	SELECT
9	JamesClavell@[...].onmicrosoft.com	EXTERNAL_USER	EXTERNAL	GRANT	CONNECT

During troubleshooting it might be useful to check current user's permissions at the various levels via sys.fn_my_permissions:

-- retrieve database-scoped permissions for current user
SELECT *
FROM sys.fn_my_permissions(NULL, 'Database');

-- retrieve schema-scoped permissions for current user
SELECT *
FROM sys.fn_my_permissions('SalesLT', 'Schema');

-- retrieve object-scoped permissions for current user
SELECT *
FROM sys.fn_my_permissions('SalesLT.Customer', 'Object')
WHERE permission_name = 'SELECT';

Notes:
1) See also [1] and [4] in what concerns the limitations that apply to managing permissions in SQL databases.

Happy coding!

Previous Post <<||>> Next Post

References:
[1] Microsoft Learn (2024) Microsoft Fabric: Share your SQL database and manage permissions [link]
[2] Microsoft Learn (2024) Microsoft Fabric: Share data and manage access to your SQL database in Microsoft Fabric [link]
[3] Microsoft Learn (2024) Authorization in SQL database in Microsoft Fabric [link]
[4] Microsoft Learn (2024) Authentication in SQL database in Microsoft Fabric [link]

[5] Microsoft Fabric Learn (2025) Manage access for SQL databases in Microsoft Fabric with workspace roles and item permissions [link]

08 December 2024

🏭🗒️Microsoft Fabric: Shortcuts [Notes]

Disclaimer: This is work in progress intended to consolidate information from various sources for learning purposes. For the latest information please consult the documentation (see the links below)!

Last updated: 29-May-2025

[Microsoft Fabric] Shortcut

{def} object that points to other internal or external storage location (aka shortcut) [1] and that can be used for data access

serves as virtual pointer to data stored in other locations [6]
{goal} unifies existing data without copying or moving it [2]

⇒ data can be used multiple times without being duplicated [2]
{benefit} helps to eliminate edge copies of data [1]
{benefit} reduces process latency associated with data copies and staging [1]

is a mechanism that allows to unify data across domains, clouds, and accounts through a namespace [1]

⇒ allows creating a single virtual data lake for the entire enterprise [1]
⇐ available in all Fabric experiences [1]
⇐ behave like symbolic links [1]

independent object from the target [1]
appear as folder [1]
can be used by workloads or services that have access to OneLake [1]
transparent to any service accessing data through the OneLake API [1]

can point to

OneLake locations
ADLS Gen2 storage accounts
Amazon S3 storage accounts
Dataverse
on-premises or network-restricted locations via PDF

{capability} create shortcut to consolidate data across artifacts or workspaces, without changing data's ownership [2]
{capability} data can be compose throughout OneLake without any data movement [2]
{capability} allow instant linking of data already existing in Azure and in other clouds, without any data duplication and movement [2]

⇐ makes OneLake the first multi-cloud data lake [2]

{capability} provides support for industry standard APIs

⇒ OneLake data can be directly accessed via shortcuts by any application or service [2]

{operation} creating a shortcut

can be created in

lakehouses
KQL databases

⇐ shortcuts are recognized as external tables [1]

can be created via

Fabric UI
REST API

can be created across items [1]

the item types don't need to match [1]

e.g. create a shortcut in a lakehouse that points to data in a data warehouse [1]

[lakehouse] tables folder

represents the managed portion of the lakehouse

shortcuts can be created only at the top level [1]

⇒ shortcuts aren't supported in other subdirectories [1]

if shortcut's target contains data in the Delta\Parquet format, the lakehouse automatically synchronizes the metadata and recognizes the folder as a table [1]

[lakehouse] files folder

represents the unmanaged portion of the lakehouse [1]
there are no restrictions on where shortcuts can be created [1]

⇒ can be created at any level of the folder hierarchy [1]
⇐ table discovery doesn't happen in the Files folder [1]

[lakehouse] all shortcuts are accessed in a delegated mode when querying through the SQL analytics endpoint [5]

the delegated identity is the Fabric user that owns the lakehouse [5]

{default} the owner is the user that created the lakehouse and SQL analytics endpoint [5]

⇐ can be changed in select cases
the current owner is displayed in the Owner column in Fabric when viewing the item in the workspace item list

⇒ the querying user is able to read from shortcut tables if the owner has access to the underlying data, not the user executing the query [5]

⇐ the querying user only needs access to select from the shortcut table [5]

{feature} OneLake data access roles

{enabled} access to a shortcut is determined by whether the SQL analytics endpoint owner has access to see the target lakehouse and read the table through a OneLake data access role [5]
{disabled} shortcut access is determined by whether the SQL analytics endpoint owner has the Read and ReadAll permission on the target path [5]

{operation} renaming a shortcut
{operation} moving a shortcut
{operation} deleting a shortcut

doesn't affect the target [1]

⇐ only the shortcut object is deleted [1]

shortcuts don't perform cascading deletes [1]
moving, renaming, or deleting a target path can break the shortcut [1]

{operation} delete file/folder

file or folder within a shortcut can be deleted when the permissions in the shortcut target allows it [1]

{permissions} users must have permissions in the target location to read the data [1]

when a user accesses data through a shortcut to another OneLake location, the identity of the calling user is used to authorize access to the data in the target path of the shortcut [1] (aka passthrough auth model [6])

ensures that any user accessing the shortcut is only able to see whatever they have access to in the target [6]
the security from the target ‘flows across’ the shortcut to restrict access in the source lakehouse [6]
OneLake to OneLake shortcuts support only passthrough mode [6]

ensures that the source system retains full control over its data [6]

⇐ there’s no need to replicate or redefine access controls for the shortcut [6]
{benefit} reduces administrative overhead since security policies only need to be maintained in one place [6]
{constraint} security cannot be modified directly from the downstream item [6]

ensures that the source system retains full control over its data [6]

any changes to access permissions must be made at the source location [6]
the source remains the single point of truth for access control [6]

⇐ ensures consistency
⇐ minimes the risk of misconfiguration [6]

{type} delegated auth mode

shortcuts access data by using some intermediate credential

e.g. another user or an account key
allow for permission management to be separated or ‘delegated’ to another team or downstream user to manage [6]

always break the flow of security from one system to another [6]
all delegated shortcuts in OneLake can have OneLake security roles defined for them [6]

all shortcuts from OneLake to external systems are delegated [6]

e.g. AWS S3 or Google Cloud Storage
allows users to connect to the external system without being given direct access [6]
OneLake security can then be configured on the shortcut to limit what data in the external system can be accessed [6]

when accessing shortcuts through Power BI semantic models or T-SQL, the calling user’s identity is not passed through to the shortcut target [1]

the calling item owner’s identity is passed instead, delegating access to the calling user [1]

OneLake manages all permissions and credentials

{type} OneLake to OneLake shortcuts

ideal for ensuring the hub retains control over sensitive or regulated data [6]

each downstream team

can then only consume the data they are allowed to [6]
has the freedom to create its own reports or combine the hub data with other data that they own [6]

{concept} hub-and-spoke model

allows to manage the data access across multiple teams or departments [6]
{component} hub

the central data repository where core datasets are stored [6]
security policies are meticulously defined to ensure robust control [6]

{component} spokes

individual teams or departments access the hub’s data through shortcuts [6]

{advantage} enables centralized governance while allowing decentralized consumption and use of data [6]
can be leveraged in various ways to create efficient and secure data architectures [6]

{type} delegated shortcuts

allow to share data securely centralize data across clouds, without copying it [6]

the data that already exists in various cloud storage accounts is consolidated in OneLake through the use of delegated shortcuts [6]
a new lakehouse is created as the consolidation point [6]
each external data source is connected via a delegated shortcut [6]

the admin can define OneLake security roles to govern access
granularity: row, column, schemas or shortcuts [6]

⇒ no user will have direct access to the external data ⇐ they will be limited to only what the admin allows through OneLake security [6]
⇐ once the data is consolidated, it can be combined with the hub-and-spoke model to create a composite architecture that keeps both upstream and downstream data safe [6]

{feature} shortcut caching

{def} mechanism used to reduce egress costs associated with cross-cloud data access [1]

when files are read through an external shortcut, the files are stored in a cache for the Fabric workspace [1]

subsequent read requests are served from cache rather than the remote storage provider [1]
cached files have a retention period of 24 hours
each time the file is accessed the retention period is reset [1]
if the file in remote storage provider is more recent than the file in the cache, the request is served from remote storage provider and the updated file will be stored in cache [1]
if a file hasn’t been accessed for more than 24hrs it is purged from the cache [1]

{restriction} individual files greater than 1 GB in size are not cached [1]
{restriction} only GCS, S3 and S3 compatible shortcuts are supported [1]

{feature} query acceleration

caches data as it lands in OneLake, providing performance comparable to ingesting data in Eventhouse [4]

{limitation} maximum number of shortcuts [1]

per Fabric item: 100,000
in a single OneLake path: 10
direct shortcuts to shortcut links: 5

{limitation} ADLS and S3 shortcut target paths can't contain any reserved characters from RFC 3986 section 2.2 [1]
{limitation} shortcut names, parent paths, and target paths can't contain "%" or "+" characters [1]
{limitation} shortcuts don't support non-Latin characters[1]
{limitation} Copy Blob API not supported for ADLS or S3 shortcuts[1]
{limitation} copy function doesn't work on shortcuts that directly point to ADLS containers

{recommended} create ADLS shortcuts to a directory that is at least one level below a container [1]

{limitation} additional shortcuts can't be created inside ADLS or S3 shortcuts [1]
{limitation} lineage for shortcuts to Data Warehouses and Semantic Models is not currently available[1]
{limitation} it may take up to a minute for the Table API to recognize new shortcuts [1]
introduce unique considerations when it comes to security [6]

Previous Post <<||>> Next Post

References:
[1] Microsoft Learn (2024) Fabric: OneLake shortcuts [link]
[2] Microsoft Learn (2024) Fabric Analyst in a Day [course notes]

[3] Microsoft Learn (2024) Use OneLake shortcuts to access data across capacities: Even when the producing capacity is paused! [link]

[4] Microsoft Learn (2024) Fabric: Query acceleration for OneLake shortcuts - overview (preview) [link]

[5] Microsoft Learn (2024) Microsoft Fabric: How to secure a lakehouse for Data Warehousing teams [link]

[6] Microsoft Fabric Update Blog (2025) Understanding OneLake Security with Shortcuts [link]

Acronyms:

ADLS - Azure Data Lake Storage
API - Application Programming Interface

AWS - Amazon Web Services

GCS - Google Cloud Storage

KQL - Kusto Query Language

OPDG - on-premises data gateway

30 October 2018

💠🛠️SQL Server: Administration (Troubleshooting Login Failed for User)

    Since the installation of an SQL Server 2017 on a virtual machine (VM) in the Microsoft Cloud started to appear in the error log records with the following message:

Login failed for user '<domain>\<computer>$'. Reason: Could not find a login matching the name provided. [CLIENT: <local machine>]
Error: 18456, Severity: 14, State: 5.

   From the text it seemed like a permission problem, thing confirmed by the documentation (see [1]), the Error Number and State correspond to a „User Id is not valid“ situation. In a first step I attempted to give permissions to the local account (dollar sign included). The account wasn’t found in the Active Directory (AD), though by typing the account directly in the “Login name” I managed to give temporarily sysadmin permission to the account. The error continued to appear in the error log. I looked then at the accounts under which the SQL Services run - nothing suspect in there.

   Except the error message, which was appearing with an alarming frequency (a few seconds apart), everything seemed to be working on the server. The volume of records (a few hundred thousands over a few days) bloating the error log, as well the fact that I didn’t knew what’s going on made me take the time and further investigate the issue.

Looking today at the Windows Logs for Applications I observed that the error is caused by an account used for the Microsoft SQL Server IaaS Agent and IaaS Query Service. Once I gave permissions to the account the error disappeared.

   The search for a best practice on what permissions to give to the IaaS Agent and IaaS Query Service lead me to [2]. To quote, the “Agent Service needs Local System rights to be able to install and configure SQL Server, attach disks and enable storage pool and manage automated security patching of Windows and SQL server”, while the “IaaS Query Service is started with an NT Service account which is a Sys Admin on the SQL Server”. In fact, this was the only resource I found that made a reference to the IaaS Query Service.

   This was just one of the many scenarios in which the above error appears. For more information see for example [3], [4] or [5].

References:
[1] Microsoft (2017) MSSQLSERVER_18456 [Online] Available from: https://docs.microsoft.com/en-us/sql/relational-databases/errors-events/mssqlserver-18456-database-engine-error?view=sql-server-2017
[2] SQL Database Engine Blog (2018) SQL Server IaaS Extension Query Service for SQL Server on Azure VM, by Mine Tokus Altug [Online] Available from: https://blogs.msdn.microsoft.com/sqlserverstorageengine/2018/10/25/sql-server-iaas-extension-query-service-for-sql-server-on-azure-vm/
[3] Microsoft Support (2018) "Login failed for user" error message when you log on to SQL Server [Online] Available from: https://support.microsoft.com/en-sg/help/555332/login-failed-for-user-error-message-when-you-log-on-to-sql-server
[4] Microsoft Technet (2018) How to Troubleshoot Connecting to the SQL Server Database [Online] Available from: Engine https://social.technet.microsoft.com/wiki/contents/articles/2102.how-to-troubleshoot-connecting-to-the-sql-server-database-engine.aspx
[5] Microsoft Blogs (2011)Troubleshoot Connectivity/Login failures (18456 State x) with SQL Server, by Sakthivel Chidambaram [Online] Available from: https://blogs.msdn.microsoft.com/sqlsakthi/2011/02/06/troubleshoot-connectivitylogin-failures-18456-state-x-with-sql-server/

18 June 2017

💠🛠️SQL Server: Administration (Database Recovery on SQL Server 2017)

I installed today SQL Server 2017 CTP 2.1 on my Lab PC without any apparent problems. It was time to recreate some of the databases I used for testing. As previously I had an evaluation version of SQL Server 2016, it expired without having a backup for one of the databases. I could recreate the database from scripts and reload the data from various text files. This would have been a relatively laborious task (estimated time > 1 hour), though the chances were pretty high that everything would go smoothly. As the database is relatively small (about 2 GB) and possible data loss was neglectable, I thought it would be possible to recover the data from the database with minimal loss in less than half of hour. I knew this was possible, as I was forced a few times in the past to recover data from damaged databases in SQL Server 2005, 2008 and 2012 environments, though being in a new environment I wasn’t sure how smooth will go and how long it would take.

Plan A - Create the database with ATTACH_REBUILD_LOG option:

As it seems the option is available in SQL Server 2017, so I attempted to create the database via the following script:

CREATE DATABASE  ON 
(FILENAME='I:\Data\.mdf') 
FOR ATTACH_REBUILD_LOG

And as expected I run into the first error:

Msg 5120, Level 16, State 101, Line 1 Unable to open the physical file "I:\Data\.mdf". Operating system error 5: "5(Access is denied.)".

Msg 1802, Level 16, State 7, Line 1 CREATE DATABASE failed. Some file names listed could not be created. Check related errors.

It looked like a permissions problem, though I wasn’t entirely sure which account is causing the problem. In the past I had problems with the Administrator account, so it was the first thing to try. Once I removed the permissions for Administrator account to the folder containing the database and gave it full control permissions again, I tried to create the database anew using the above script, running into the next error:

File activation failure. The physical file name "D:\Logs\_log.ldf" may be incorrect. The log cannot be rebuilt because there were open transactions/users when the database was shutdown, no checkpoint occurred to the database, or the database was read-only. This error could occur if the transaction log file was manually deleted or lost due to a hardware or environment failure.

Msg 1813, Level 16, State 2, Line 1 Could not open new database ''. CREATE DATABASE is aborted.

This approach seemed to lead nowhere, so it was time for Plan B.

Plan B - Recover the database into an empty database with the same name:

Step 1: Create a new database with the same name, stop the SQL Server, then copy the old file over the new file, and delete the new log file manually. Then restarted the server. After the restart the database will appear in Management Studio with the SUSPECT state.

Step 2: Set the database in EMERGENCY mode:

ALTER DATABASE  SET EMERGENCY, SINGLE_USER

Step 3: Rebuild the log file:

ALTER DATABASE <database_name>

REBUILD LOG ON (Name=’_Log',

FileName='D:\Logs\.ldf')

The rebuild worked without problems.

Step 4: Set the database in MULTI_USER mode:

ALTER DATABASE  SET MULTI_USER

Step 5: Perform a consistency check:

DBCC CHECKDB () WITH ALL_ERRORMSGS, NO_INFOMSG

After 15 minutes of work the database was back online.

Warnings:
Always attempt to recover the data for production databases from the backup files! Use the above steps only if there is no other alternative!
The consistency check might return errors. In this case one might need to run CHECKDB with REPAIR_ALLOW_DATA_LOSS several times [2], until the database was repaired.
After recovery there can be problems with the user access. It might be needed to delete the users from the recovered database and reassign their permissions!

Resources:
[1] In Recovery (2008) Creating, detaching, re-attaching, and fixing a SUSPECT database, by Paul S Randal [Online] Available from: https://www.sqlskills.com/blogs/paul/creating-detaching-re-attaching-and-fixing-a-suspect-database/
[2] In Recovery (2009) Misconceptions around database repair, by Paul S Randal [Online] Available from: https://www.sqlskills.com/blogs/paul/misconceptions-around-database-repair/
[3] Microsoft Blogs (2013) Recovering from Log File Corruption, by Glen Small [Online] Available from: https://blogs.msdn.microsoft.com/glsmall/2013/11/14/recovering-from-log-file-corruption/

SQL Troubles

Pages

13 March 2025

🏭🗒️Microsoft Fabric: Workspaces [Notes]

11 March 2025

🏭🎗️🗒️Microsoft Fabric: Real-Time Dashboards (RTD) [Notes]

22 January 2025

🏭🗒️Microsoft Fabric: Clone Tables in Warehouses [Notes]

🏭🗒️Microsoft Fabric: Folders [Notes]

20 January 2025

🏭🗒️Microsoft Fabric: [Azure] Service Principals (SPN) [Notes]

17 January 2025

💎🏭SQL Reloaded: Microsoft Fabric's SQL Databases (Part VIII: Permissions) [new feature]

08 December 2024

🏭🗒️Microsoft Fabric: Shortcuts [Notes]

30 October 2018

💠🛠️SQL Server: Administration (Troubleshooting Login Failed for User)

18 June 2017

💠🛠️SQL Server: Administration (Database Recovery on SQL Server 2017)

About Me