How to use distinct with group by in Linq to SQL

2019-06-15 13:17发布

I'm trying to convert the following sql to Linq 2 SQL:

select groupId, count(distinct(userId)) from processroundissueinstance 
group by groupId

Here is my code:

var q = from i in ProcessRoundIssueInstance
    group i by i.GroupID into g
    select new
    {
        Key = g.Key,
        Count = g.Select(x => x.UserID).Distinct().Count()
    };

When I run the code, I keep getting Invalid GroupID. Any ideas? Seems the distinct is screwing things up..

Here is the generated sql:

SELECT [t1].[GroupID] AS [Key], (
SELECT COUNT(*)
FROM (
    SELECT DISTINCT [t2].[UserID]
    FROM [ProcessRoundIssueInstance] AS [t2]
    WHERE (([t1].[GroupID] IS NULL) AND ([t2].[GroupID] IS NULL)) 
       OR (([t1].[GroupID] IS NOT NULL) 
            AND ([t2].[GroupID] IS NOT NULL) 
            AND ([t1].[GroupID] = [t2].[GroupID]))
    ) AS [t3]
) AS [Count]
FROM (
    SELECT [t0].[GroupID]
    FROM [ProcessRoundIssueInstance] AS [t0]
    GROUP BY [t0].[GroupID]
    ) AS [t1]

5条回答
老娘就宠你
2楼-- · 2019-06-15 13:37

According to this post, your code looks correct:

LINQ to SQL using GROUP BY and COUNT(DISTINCT)

Have you tried inspecting the SQL that is generated?

查看更多
贼婆χ
3楼-- · 2019-06-15 13:38

There appears to be a whole bunch of goop in the generated SQL to deal with the GroupID being NULL. If that a possiblity? IF not, try changing the definition to make it NOT NULL.

查看更多
趁早两清
4楼-- · 2019-06-15 13:42

Are you certain of correct database integrity? Anyway maybe you should try this: I don't know how a group would be empty, but that seems to be your problem.

ProcessRoundIssueInstance.Where(i => i.GroupId != null)
    .GroupBy(i => i.GroupID)
    .Select(group => new 
                    { 
                      Key = group.Key,
                      Count = group.SingleOrDefault() == null ? 0 : 
                              group.SingleOrDefault().Select( item => item.UserID).Distinct().Count() 
                     });
查看更多
beautiful°
5楼-- · 2019-06-15 13:45

Try a where clause to eliminate spurious Ids after the join...

var q = from i in ProcessRoundIssueInstance
    where i.GroupID != ""
    group i by i.GroupID into g
    select new
    {
        Key = g.Key,
        Count = g.Select(x => x.UserID).Distinct().Count()
    };
查看更多
狗以群分
6楼-- · 2019-06-15 13:48

I think Basiclife is close, but checking if the id is empty may not be the issue or enough, you should check to make sure it is not null before doing the group since you said it is a nullable field. Otherwise it looks right, and if you are having issues you may have bad data, or it is a bug or not fully implemented feature of Linq to SQL, and you may want to try Linq to Entity.

var q = from i in ProcessRoundIssueInstance
        where i.GroupID != null
        && i.GroupID != string.Empty
        group i by i.GroupID into g        
        select new
        {
            Key = g.Key,
            Count = g.Select(x => x.UserID).Distinct().Count()
        };
查看更多
登录 后发表回答