This discussion has been locked.

You can no longer post new replies to this discussion. If you have a question you can start a new discussion

Can I use a Single Linux OS to schedule two DSU cluster?

embedded2023 over 2 years ago

I have a SoCmade of two DynamicIQ shared unit cluster, Cluster0: 4xA76+4xA55, Cluster2: 4xA55. Each of them have their own L3, and connect to a NoC. The typical use case of this SoC is to run two OS on two clusters. e.g. Android on Cluster1 and linux on Cluster2, they communicate through shared memory and mailbox mechansim to accomendate specific applications such as IVI or Service Robot.

I want to know if it is possible to rule them under the same linux OS, so more generic applications scenario can be expanded without need the complexity of two OS or wasting computing resources on the second 'little' cluster.

I see NUMA is something close but not very sure if it is feasible in this scenario.

Top replies

Martin Weidmann over 2 years ago in reply to embedded2023 +1

I'm not a kernel developer, but given what you've said I suspect it will be an uphill battle. Taking the GIC first, an OS is typically going to expect one GIC shared by all the cores running the same...

0 Martin Weidmann over 2 years ago
Whether you can run one instance across the clusters, and how easy it is to do so, depends a lot of how the SoC is laid out. If the SoC was intended to run two different OS instances (1 per cluster) there could be design decisions that'll make it impractical.

A few starting questions:

Do the cores in both clusters have the same set of arch options? I know you said it's A76+A55 on one and A55 on the other, but worth checking that there's no differences between the A55s in the two clusters. At the risk of over generalising, OSs don't like running on cores which aren't architecturally the same,

Are the two clusters cache coherent with each other and does the NoC support DVM traffic between the clusters? If the answer to either is "no", then it's going to be harder and probably less performant.

Do the two clusters share a GIC (interrupt controller) or is there a separate GIC per-cluster? If it's separate GICs per-cluster that again will make it harder.
Cancel
Vote up 0 Vote down

Cancel
0 embedded2023 over 2 years ago in reply to Martin Weidmann

Yes. the soc is designed to run two OS

1. The L1 and L2 cache are the same, L3 size is different for two clustert (L3 is part of DSU, not per core). The 'small' cluster A55 does not have Trust-Zone, i'm not aware of other differences.

2. The two cluster (or OS) is designed to communicate over Shared Memory and mailbox (FIFO), Shared Memoyr is like 1MB, Mailbox is only with few bytes, by default the two OS communicate over RPmsg.

3. they have their own GIC, but assigned with same PPI/SPI interrupt, all peripherals can be route to either cluster..
Cancel
Vote up 0 Vote down

Cancel
0 Martin Weidmann over 2 years ago in reply to embedded2023

I'm not a kernel developer, but given what you've said I suspect it will be an uphill battle.

Taking the GIC first, an OS is typically going to expect one GIC shared by all the cores running the same instance of the OS. That's the assumption most standard drivers I've seen operate on. I think you could work around it, but it could be quite complicated. One example of why, in GIC an interrupt can only be Acknowledged by the core it is targeted to, but it can Deactivated from any core. Things like threaded interrupt handlers make use of that property - but it won't work if the Ack'ing core and Deactivating core are connected to different GICs. SW would have to know that was a possibility, test for it, and then deal with it in an SoC-specific way.

The memory system also sounds like it's going to be challenging. If I've understood you correctly, most of the memory each core can see is not visible to the other cluster. It's only a small portion of memory which is visible to both. You could limit the allocator to just using the shared memory, but as it's only 1MB that's not very useful. Using the non-shared memory is going to be a real challenge, as the kernel would need to know where the process was running to decide which memory to allocate. Something like that can happen already, but that's usually for performance reasons not functional reasons.
Cancel
Vote up +1 Vote down

Cancel