Print NBA MVP list in rust

Print NBA MVP list in rust - sorting

I was reading https://www.nba.com/history/awards/mvp.
And I tried to print out the name and count in descending order.
Kareem Abdul-Jabbar: 6
Bill Russell: 5
Michael Jordan: 5
LeBron James: 4
Wilt Chamberlain: 4
Magic Johnson: 3
...
As a beginner, I managed to do it like below.
I used a HashMap to count by name and a Vec to sort by value.
But I think there might be a better solution.
I will appreciate any advice to improve this.
use std::collections::HashMap;
const MVPS: &str = concat!(
"2017-18 James Harden, Houston Rockets (Voting)\n",
"2016-17 Russell Westbrook, Oklahoma City Thunder (Voting)\n",
... omitted for brevity
);
fn main() {
let lines: Vec<&str> = MVPS.split('\n')
.map(|line| &line[12..line.find(',').unwrap_or(line.len())])
.collect();
let mut mvp_map = HashMap::new();
for name in &lines {
mvp_map.insert(name, mvp_map.get(name).unwrap_or(&0) + 1);
}
let mut mvp_entry_list = Vec::new();
for entry in &mvp_map {
mvp_entry_list.push(entry);
}
mvp_entry_list.sort_by(|a, b| (b.1).cmp(a.1));
for e in &mvp_entry_list {
println!("{}: {}", e.0, e.1);
}
}

Finally I got better one.
use itertools::Itertools;
const MVPS: &str = concat!(
"2017-18 James Harden, Houston Rockets (Voting)\n",
"2016-17 Russell Westbrook, Oklahoma City Thunder (Voting)\n",
... omitted for brevity
);
fn main() {
let mvps_count = MVPS.lines()
.map(|line| &line[12..line.find(',').unwrap_or(line.len())])
.into_iter()
.sorted()
.group_by(|name| *name)
.into_iter()
.map(|(key, group)| (key, group.count()))
.sorted_by(|a, b| (b.1).cmp(&a.1))
.inspect(|(key, count)| println!("{}: {}", key, count))
.map(|(_key, count)| count)
.fold(0 as usize, |a, b| a + b);
println!("{}", MVPS.lines().count() == mvps_count);
}

Related

Why does my Rust StackDFS implementation produce incorrect results? [closed]

Closed. This question needs debugging details. It is not currently accepting answers.
Edit the question to include desired behavior, a specific problem or error, and the shortest code necessary to reproduce the problem. This will help others answer the question.
Closed 1 year ago.
Improve this question
I am reading Luridas's book about algorithms and now I am at the Chapter 2 trying to implement StackDFS function. The purpose of this function is to "visit" every element of the graph.
Algorithm from the book:
StackDFS(G, node)→visited
//Input: G = (V,E), a graph
//node, the starting vertex in G
//Output: visited, an array of size |V | such that visited[i] is true if we
//have visited node i, false otherwise
1 S ←CreateStack()
2 visited ←CreateArray(|V |)
3 for i ←0 to |V | do
4 visited[i] ← false
5 Push(S, node)
6 while not IsStackEmpty(S) do
7 c ←Pop(s)
8 visited[c] ← true
9 foreach v in AdjacencyList(G, c) do
10 if not visited[v] then
11 Push(S, v)
12 return visited
My code:
use petgraph::graph::NodeIndex;
use petgraph::Undirected;
fn main() {
let mut new = petgraph::Graph::<i32, (i32, i32), Undirected>::new_undirected();
new.extend_with_edges(&[(0, 1), (0, 3), (1, 2), (2, 4)]);
let index = 2;
let mut visited = Vec::new();
for i in 0..new.node_count() {
visited.push(false);
}
StackDFS(&mut new, &index, &mut visited);
}
fn StackDFS<T>(G: &petgraph::Graph<T, (T, T), Undirected>, node: &usize, visited: &mut Vec<bool>) {
let mut s: Vec<usize> = Vec::with_capacity(G.node_count());
// for i in 0..G.node_count(){
// visited.push(false);
//}
s.push(*node);
while s.is_empty() == false {
let c = s.pop().unwrap();
visited[c] = true;
for el in G.neighbors(NodeIndex::new(*node)) {
if visited[el.index()] == false {
s.push(el.index());
}
}
}
println!("{:?}", visited);
}
The output I get:
[false, true, true, false, true]
The output I expect:
[true, true, true, true, true]
I used a Vec instead of a stack, but I don't think this is the problem because Vec can provide the same functionality as a stack.

for el in G.neighbors(NodeIndex::new(*node)) {
You are always accessing the neighbors of the same starting node. You are supposed to get the neighbors of the currently processed node.
Improved version:
use petgraph::{graph::NodeIndex, Undirected}; // 0.6.0
fn main() {
let mut new = petgraph::Graph::<i32, (i32, i32), Undirected>::new_undirected();
new.extend_with_edges(&[(0, 1), (0, 3), (1, 2), (2, 4)]);
let index = 2;
let mut visited = vec![false; new.node_count()];
stack_dfs(&new, index, &mut visited);
}
fn stack_dfs<T>(
g: &petgraph::Graph<T, (T, T), Undirected>,
start_node: usize,
visited: &mut Vec<bool>,
) {
let mut s = Vec::with_capacity(g.node_count());
s.push(start_node);
while let Some(node) = s.pop() {
visited[node] = true;
for el in g.neighbors(NodeIndex::new(node)) {
let idx = el.index();
if !visited[idx] {
s.push(idx);
}
}
}
println!("{:?}", visited);
}

How to concat vector elements and store it back to vector?

I have a vector of a vector and need to concatenate the second one to the first (it's ok if the second one is dropped), i.e.
f([[1,2,3], [4,5,6]]) => [[1,2,3,4,5,6], []]
or
f([[1,2,3], [4,5,6]]) => [[1,2,3,4,5,6], [4,5,6]]
Both are okay.
My initial solution is:
fn problem() {
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
items[0].append(&mut items[1]);
}
But it has a compile time error due to 2 mutable borrows:
| items[0].append(&mut items[1]);
| ----- ------ ^^^^^ second mutable borrow occurs here
| | |
| | first borrow later used by call
| first mutable borrow occurs here
I could solve it with Box / Option, but I wonder whether there are better ways to solve this?
My solution with Box:
fn solution_with_box() {
let mut items = Vec::new();
items.push(Box::new(vec![1,2,3]));
items.push(Box::new(vec![4,5,6]));
let mut second = items[1].clone();
items[0].as_mut().append(second.as_mut());
}
My solution with Option:
fn solution_with_option() {
let mut items = vec::new();
items.push(some(vec![1,2,3]));
items.push(some(vec![4,5,6]));
let mut second = items[1].take();
items[0].as_mut().unwrap().append(second.as_mut().unwrap());
}

You can clone the data of items[1] as follows:
fn main() {
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
let mut a: Vec<i32> = items[1].clone();
&items[0].append(&mut a);
}
If you don't want to clone the data, you can use mem::take as suggested by #trentcl
fn main() {
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
let second = std::mem::take(&mut items[1]);
items[0].extend(second);
println!("{:?}", items);
}

This is not fastest way of doing it but it solves your problem.
fn problem() {
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
//we can not have two mutable references in the same scope
// items[0].append(&mut items[1]);
// instead you can flatten vector
let first = items.into_iter().flatten().collect(); // we consume items so its no longer available
let items = vec![first, vec![]];
println!("{:?}", items); // [[1,2,3,4,5,6], []]
}

You can use split_at_mut on a slice or vector to get mutable references to non-overlapping parts that can't interfere with each other, so that you can mutate the first inner vector and the second inner vector at the same time.
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
let (contains_first, contains_second) = items.split_at_mut(1);
contains_first[0].append(&mut contains_second[0]);
dbg!(items);
Rust Playground link
No copying or cloning occurs. Note that contains_second[0] corresponds to items[1] because the second slice split_at_mut returns starts indexing at wherever the split point is (here, 1).

you can solve the problem in two steps:
append the empty vector at the end
remove items[1] and append its elements to items[0]
fn problem() {
let mut items = Vec::new();
items.push(vec![1,2,3]);
items.push(vec![4,5,6]);
items.push(vec![0;0]);
let v = items.remove(1);
items[0].extend(v);
}

Reverse specific key when sorting with multiple keys

When sorting with multiple keys, how can I reverse the order of an individual key? For example:
vec.sort_by_key(|k| (foo(k).reverse(), bar(k)));

You can use sort_by paired with Ordering::reverse instead of sort_by_key.
use std::cmp::Ordering;
#[derive(Debug)]
struct Foo(&'static str, u8);
impl Foo {
fn name(&self) -> &str { self.0 }
fn len(&self) -> u8 { self.1 }
}
fn main() {
let mut vec = vec![Foo("alpha", 1), Foo("beta", 2), Foo("beta", 1)];
vec.sort_by(|a, b| {
match a.name().cmp(b.name()).reverse() {
Ordering::Equal => a.len().cmp(&b.len()),
other => other,
}
});
println!("{:?}", vec);
}
This sorts in reverse alphabetical order, then ties are sorted in ascending numerical order:
[Foo("beta", 1), Foo("beta", 2), Foo("alpha", 1)]
Since Rust 1.17 (via RFC 1677), you can write it like this:
vec.sort_by(|a, b| {
a.name().cmp(b.name()).reverse()
.then(a.len().cmp(&b.len()))
});
If you have something that can naturally be negated / inverted, you can simply negate the key.

Since Rust 1.19, the std::cmp::Reverse struct wraps a value and implements PartialOrd and Ord by calling partial_cmp and cmp with swapped arguments in order to return the reversed order. Just wrap the key to sort in descending order:
vec.sort_by_key(|k| (Reverse(foo(k)), bar(k)));
Before Rust 1.19, you can use the revord crate (documentation) which provides the struct RevOrd which provides the same benefit::
vec.sort_by_key(|k| (RevOrd(foo(k)), bar(k)));

Here's a similar approach to the problem: create a function for chaining multiple orderings:
/// chain two orderings: the first one gets more priority
fn chain_ordering(o1: Ordering, o2: Ordering) -> Ordering {
match o1 {
Ordering::Equal => o2,
_ => o1,
}
}
Then use sort_by, possibly with pattern matching, to produce the ordering of each key:
#[derive(Debug, PartialEq)]
struct HeroSkill(&'static str, &'static str);
fn main() {
// a vector of hero names and super powers
let mut v = vec![
HeroSkill("Bob", "X"),
HeroSkill("Bob", "Y"),
HeroSkill("Alice", "X")
];
// sort by name, then by super power, where Y is more powerful than X
v.sort_by(|&HeroSkill(name1, power1), &HeroSkill(name2, power2)| {
chain_ordering(name1.cmp(name2), power1.cmp(power2).reverse())
});
assert_eq!(v, vec![
HeroSkill("Alice", "X"),
HeroSkill("Bob", "Y"),
HeroSkill("Bob", "X")
]);
}
Playground

Cannot change one value of an enum because it's a re-assignment of an immutable variable

I have enums that contain variables:
enum Asymmetric {
One(i32),
Two(i32, i32),
}
I want to change just one field of an already existing enum, without reassigning the entire enum. My code (playground):
// Does not compile
fn main() {
let two = Asymmetric::Two(4, 5);
let mut vec = vec![two];
foo(&mut vec[0]);
}
fn foo(baa: &mut Asymmetric) {
match baa {
&mut Asymmetric::Two(x0, x1) => {
x0 = 6;
}
_ => {}
}
}
This results in this error:
error[E0384]: re-assignment of immutable variable `x0`
--> src/main.rs:16:13
|
15 | &mut Asymmetric::Two(x0, x1) => {
| -- first assignment to `x0`
16 | x0 = 6;
| ^^^^^^ re-assignment of immutable variable

Thanks to "match ergonomics" (introduced in Rust 1.26, proposed here), you can write your code like this:
fn foo(baa: &mut Asymmetric) {
match baa {
Asymmetric::Two(x0, _) => {
*x0 = 6;
}
_ => {}
}
}
Since baa is a mutable reference, but your pattern you're matching against (Asymmetric::Two(x0, _)) is not, the name x0 is automatically bound as mutable reference.
You can also do it manually by using ref mut. See this working code (playground):
fn foo(baa: &mut Asymmetric) {
match *baa {
Asymmetric::Two(ref mut x0, _) => {
*x0 = 6;
}
_ => {}
}
}
Some minor changes that are not related to your error, but which increase your code quality:
usually you deref (with *) the matched-on value instead of adding & or &mut to every pattern in the match
you should use _ as a name placeholder if you don't need to bind to that name
In your case, you can simplify the code even further by using if let. Whenever you are only interested in one match-case, you should use if let instead:
fn foo(baa: &mut Asymmetric) {
if let Asymmetric::Two(x0, _) = baa {
*x0 = 6;
}
}

How do I calculate the delta (inserted/deleted/moved indexes) of two lists?

Assuming I have two lists of objects that have unique ids and an attribute that determines their order, how can I efficiently get the delta indexes (which indexes were inserted, which were deleted, and which were moved)?
Example of input:
let before: [(id: String, timestamp: String)] = [
("A", "2015-06-04T12:38:09Z"),
("B", "2015-06-04T10:12:45Z"),
("C", "2015-06-04T08:39:55Z"),
("D", "2015-06-03T23:58:32Z"),
("E", "2015-06-01T00:05:51Z"),
]
let after: [(id: String, timestamp: String)] = [
("F", "2015-06-04T16:13:01Z"),
("C", "2015-06-04T15:10:29Z"),
("A", "2015-06-04T12:38:09Z"),
("B", "2015-06-04T10:12:45Z"),
]
let delta = deltaFn(before, after)
Here's the above visualized:
BEFORE AFTER
+-------+----+----------------------+ +-------+----+----------------------+
| index | id | timestamp | | index | id | timestamp |
+-------+----+----------------------+ +-------+----+----------------------+
| 0 | A | 2015-06-04T12:38:09Z | | 0 | F | 2015-06-04T16:13:01Z |
| 1 | B | 2015-06-04T10:12:45Z | | 1 | C | 2015-06-04T15:10:29Z |
| 2 | C | 2015-06-04T08:39:55Z | | 2 | A | 2015-06-04T12:38:09Z |
| 3 | D | 2015-06-03T23:58:32Z | | 3 | B | 2015-06-04T10:12:45Z |
| 4 | E | 2015-06-01T00:05:51Z | | - | | |
+-------+----+----------------------+ +-------+----+----------------------+
Expected result (delta):
Inserted indexes: [0]
Deleted indexes: [3, 4]
Moved indexes: [(from: 0, to: 2), (from: 1, to: 3), (from: 2, to: 1)]

It can be solved by using 2 maps, that map from the ID of each element to its index, and comparing them.
Time complexity is O(n) for hash maps and O(nlogn) for tree based maps.
Pseudo code:
map1 = empty map
map2 = empty map
for each element x with index i in before:
map1.insert(x,i)
for each element x with index i in after:
map2.insert(x,i)
//find moved and deleted:
for each key x in map1:
id1 = map1.get(x)
id2 = map2.get(x)
if id2 == nil:
add id1 to "deleted indexes"
else if id1 != id2:
add (id1,id2) to "moved indexes"
map2.delete(x)
//find new indexes:
for each key x in map2:
add map2.get(x) to "inserted indexes"
Edit: (Suggested in comments)
You can minimize memory output to O(min{m,n}) and time in case of tree-based map to O(max{m,n}log(min{m,n})), where m,n are the sizes of the two lists, by mapping only the smallest list, and then iterating the array (which was not mapped) rather than the map.
map = empty map
for each element x with index i in smaller list:
map.insert(x,i)
for each element x with index i1 in larger list:
i2 = map.get(x)
if i2:
if i1 != i2:
add (i2, i1) to "moved indexes" if smaller list is before
add (i1, i2) to "moved indexes" if smaller list is after
map.delete(x)
else:
add i1 to "inserted indexes" if smaller list is before
add i1 to "deleted indexes" if smaller list is after
// Find new indexes:
for each key x in map:
add map.get(x) to "deleted indexes" if smaller list is before
add map.get(x) to "inserted indexes" if smaller list is after

A possible solution (similar to #amit's answer, but using only a single
map):
// A dictionary mapping each id to a pair
// ( oldIndex, newIndex )
// where oldIndex = -1 for inserted elements
// and newIndex = -1 for deleted elements.
var map : [ String : (from: Int, to: Int)] = [:]
// Add [ id : (from, -1) ] for each id in before:
for (idx, elem) in enumerate(before) {
map[elem.id] = (from: idx, to: -1)
}
// Update [ id : (from, to) ] or add [ id : (-1, to) ] for each id in after:
for (idx, elem) in enumerate(after) {
if (map[elem.id]?.to = idx) == nil {
map[elem.id] = (from: -1, to: idx)
}
}
var insertedIndices : [Int] = []
var deletedIndices : [Int] = []
var movedIndices : [(from: Int, to: Int)] = []
// Compare from: and to: index for each dictionary value:
for pair in map.values {
switch pair {
case (let fromIdx, -1):
deletedIndices.append(fromIdx)
case (-1, let toIdx):
insertedIndices.append(toIdx)
default:
movedIndices.append(pair)
}
}
println(insertedIndices) // [0]
println(deletedIndices) // [3, 4]
println(movedIndices) // [(1, 3), (0, 2), (2, 1)]
Alternatively, use optionals to indicate the absence of old or new index, as suggested by #doisk:
// A dictionary mapping each id to a pair
// ( oldIndex, newIndex )
// where oldIndex = nil for inserted elements
// and newIndex = nil for deleted elements.
var map : [ String : (from: Int?, to: Int?)] = [:]
// Add [ id : (from, nil) ] for each id in before:
for (idx, elem) in enumerate(before) {
map[elem.id] = (from: idx, to: nil)
}
// Update [ id : (from, to) ] or add [ id : (nil, to) ] for each id in after:
for (idx, elem) in enumerate(after) {
map[elem.id] = (map[elem.id]?.from, idx)
}
// Compare:
var insertedIndices : [Int] = []
var deletedIndices : [Int] = []
var movedIndices : [(from: Int, to: Int)] = []
for pair in map.values {
switch pair {
case (let .Some(fromIdx), let .Some(toIdx)):
movedIndices.append(from: fromIdx, to: toIdx)
case (let .Some(fromIdx), .None):
deletedIndices.append(fromIdx)
case (.None, let .Some(toIdx)):
insertedIndices.append(toIdx)
default:
fatalError("Oops") // This should not happen!
}
}

My solution does not use the map function. Computational complexity is O(n * m) where n: elms in before and m: elms in after.
And I am afraid this is not the best solutions available... however here it is :)
import Foundation
// Elm class that contains id and timestamp and is Equatable
class Elm {
let id : String
let timestamp : String
init(tuple : (id:String, timestamp:String)) {
self.id = tuple.id
self.timestamp = tuple.timestamp
}
}
func ==(lhs: Elm, rhs: Elm) -> Bool {
return lhs.id == rhs.id
}
extension Elm : Equatable {}
// data
let before: [Elm] = [
Elm(tuple: ("A", "2015-06-04T12:38:09Z")),
Elm(tuple: ("B", "2015-06-04T10:12:45Z")),
Elm(tuple: ("C", "2015-06-04T08:39:55Z")),
Elm(tuple: ("D", "2015-06-03T23:58:32Z")),
Elm(tuple: ("E", "2015-06-01T00:05:51Z"))
]
let after: [Elm] = [
Elm(tuple: ("F", "2015-06-04T16:13:01Z")),
Elm(tuple: ("C", "2015-06-04T15:10:29Z")),
Elm(tuple: ("A", "2015-06-04T12:38:09Z")),
Elm(tuple: ("B", "2015-06-04T10:12:45Z"))
]
// O(m * n)
func inserted(before:[Elm], after:[Elm]) -> [Int] {
var inserted = [Int]()
for (index, elm) in enumerate(after) {
if !contains(before, elm) {
inserted.append(index)
}
}
return inserted
}
// O(n * m)
func deleted(before:[Elm], after:[Elm]) -> [Int] {
var deleted = [Int]()
for (index, elm) in enumerate(before) {
if !contains(after, elm) {
deleted.append(index)
}
}
return deleted
}
// O(n * m)
func moved(before:[Elm], after:[Elm]) -> [Int:Int] {
var moved = [Int:Int]()
for (index, elm) in enumerate(before) {
if contains(after, elm) && (after[index] != before[index]) {
moved[index] = find(after, elm)
}
}
return moved
}
inserted(before, after)
deleted(before, after)
moved(before, after)

Here's what mine would look like:
func deltaFn(before: [(id: String, timestamp: String)], after: [(id: String, timestamp: String)] ) -> ([Int], [Int], [String]) {
// Get arrays of just the ids...
let beforeIds = before.map { $0.id }
let afterIds = after.map { $0.id }
// Get the inserted and moved indexes...
let (inserted, moved) = reduce(0..<afterIds.count, (inserted: [Int](), moved: [String]())) {
(var changes, index) -> ([Int], [String]) in
if let beforeIndex = find(beforeIds, afterIds[index]) {
if beforeIndex != index {
changes.moved.append("(from: \(beforeIndex), to: \(index))")
}
} else {
changes.inserted.append(index)
}
return changes
}
// Get the deleted indexes...
let deleted = reduce(0..<beforeIds.count, [Int]()) { deleted, index in
return contains(afterIds, beforeIds[index])
? deleted
: deleted + [index]
}
// Return them all as a tuple...
return (inserted, deleted, moved)
}
let (inserted, deleted, moved) = deltaFn(before, after)
println("Inserted: \(inserted)") // Inserted: [0]
println("Deleted: \(deleted)") // Deleted: [3, 4]
println("Moved: \(moved)") // Moved: [(from: 2, to: 1), (from: 0, to: 2), (from: 1, to: 3)]
It works as expected, and is relatively easy on the eyes.
Note that the syntax of the calls to reduce will be different if you are using Swift 2.0. For example,
reduce(0..<afterIds.count, (inserted: [Int](), moved: [String]()))
becomes...
(0..<afterIds.count).reduce((inserted: [Int](), moved: [String]()))

Here's what I managed:
var map: [String : (bef: Int?, aft: Int?)] = [:]
for (idx, (bef, aft)) in zipWithPadding(before, after).enumerate()
where bef?.id != aft?.id {
bef.map{map[$0.id] = (idx, map[$0.id]?.aft)}
aft.map{map[$0.id] = (map[$0.id]?.bef, idx)}
}
for (val, id) in map {
switch id {
case (_, nil): print("\(val): del at \(id.bef!)")
case (nil, _): print("\(val): ins at \(id.aft!)")
default: print("\(val): mov from \(id.bef!) to \(id.aft!)")
}
}
//D: del at 3
//E: del at 4
//F: ins at 0
//B: mov from 1 to 3
//A: mov from 0 to 2
//C: mov from 2 to 1
This method is pretty much the same as the other map answers, except it has one fewer loop, and it skips values that are the same in each array. The map here is a dictionary of Strings (the ids in your array) and tuples. The tuples are Ints, corresponding to the index of a given id in your first array, and the index of that same id in the second. The Ints are optional: this is how we figure out what happened to each id. If the first is nil, and the second is not, then that id was inserted. If the second is nil, though, it was removed. And if both Ints are not nil, then that id has been moved from the first to the second.
The way to fill up the map is by looping through the output of the zipWithPadding function, which is here:
func zipWithPadding <
S0: SequenceType, S1: SequenceType, E0, E1 where
S0.Generator.Element == E0, S1.Generator.Element == E1
> (s0: S0, _ s1: S1) -> AnyGenerator<(E0?, E1?)> {
var (g0, g1) :
(S0.Generator?, S1.Generator?) =
(s0.generate(), s1.generate())
return anyGenerator {
let e0: E0? = g0?.next() ?? {g0 = nil; return nil}()
let e1: E1? = g1?.next() ?? {g1 = nil; return nil}()
return (e0 != nil || e1 != nil) ? (e0, e1) : nil
}
}
I got it from here. The reason you can't use the standard library zip is that it would terminate once either of the underlying sequences did. Here, though, the sequences are of different length. This zip function returns a generator of tuples of successive elements from its two sequence arguments. If either sequence finishes before the other, the subsequent tuples returned will have that sequence's value as nil. Here's an example:
Array(zipWithPadding([1, 2, 3], [1, 2]))
//[({Some 1}, {Some 1}), ({Some 2}, {Some 2}), ({Some 3}, nil)]
Because generators aren't guaranteed to continually return nil after they've returned nil once (a generator returns nil to signal that it's finished), you can't just keep calling the same generator for your tuple value. That's why the generator itself is set to nil once it returns nil: so that you don't call it anymore.
However, Array generators seem to return nil after the last value anyway. So, if you didn't mind undefined behaviour:
func zipWithPadding <
S0: SequenceType, S1: SequenceType, E0, E1 where
S0.Generator.Element == E0, S1.Generator.Element == E1
> (s0: S0, s1: S1) -> AnyGenerator<(E0?, E1?)> {
var (g0, g1) = (s0.generate(), s1.generate())
return anyGenerator {
let (e0, e1) = (g0.next(), g1.next())
return e0 != nil || e1 != nil ? (e0, e1) : nil
}
}
Once you have your generator to loop through, the rest of the idea is easy. The before and after ids are put into a dictionary, and if they weren't in the dictionary already, the corresponding index in the tuple is set to nil. (map[$0.id]?.aft will return nil if $0.id isn't in the dictionary).
In terms of efficiency, there are a few places I think this approach would work. It seems better that it's using only one loop and not two, but the custom zipWithPadding function adds so much overhead the single loop is actually less efficient than two sequential loops. Similarly, using only one enumerate() seems efficient, but again, the overhead isn't worth it. (it's worth noting that if the two arrays were the same length, the standard library zip would give you a very fast option here)
This method does allow you to skip elements that are the same in both arrays, though, something that you can't do with two for loops. In some quick testing, it looks like arrays with more than a quarter of their elements in the same positions are more quickly handled by this method. This would also speed up the looping through the map afterwards: it would only contain differences, so it would be shorter, and you wouldn't need to check for changes.
UPDATE:
I was trying to figure out how to get rid of some of the overhead, especially with regards to the generator. I made a custom struct:
struct PaddedZipGenerator<G0: GeneratorType, G1: GeneratorType> : GeneratorType {
typealias E0 = G0.Element
typealias E1 = G1.Element
typealias Element = (E0?, E1?)
private var (g0, g1): (G0?, G1?)
mutating func next() -> PaddedZipGenerator.Element? {
let e0: E0? = g0?.next() ?? {g0 = nil; return nil}()
let e1: E1? = g1?.next() ?? {g1 = nil; return nil}()
return (e0 != nil || e1 != nil) ? (e0, e1) : nil
}
}
struct PaddedZip<S0: SequenceType, S1: SequenceType> : SequenceType {
typealias Generator = PaddedZipGenerator<S0.Generator, S1.Generator>
private let (s0, s1): (S0, S1)
func generate() -> PaddedZip.Generator {
return PaddedZipGenerator(g0: s0.generate(), g1: s1.generate())
}
}
func zipWithPadding<S0: SequenceType, S1: SequenceType>(s0: S0, _ s1: S1) -> PaddedZip<S0, S1> {
return PaddedZip(s0: s0, s1: s1)
}
And it seems like it worked! With some basic testing it seems like this zipWithPadding function works quite fast. It seems to work faster than two for loops, even when both lists contain none of the same elements.

Develop Reference

ruby bash windows laravel spring algorithm oracle macos go visual-studio

Print NBA MVP list in rust - sorting

Related

Why does my Rust StackDFS implementation produce incorrect results? [closed]

How to concat vector elements and store it back to vector?

Reverse specific key when sorting with multiple keys

Cannot change one value of an enum because it's a re-assignment of an immutable variable

How do I calculate the delta (inserted/deleted/moved indexes) of two lists?

Categories

Resources