Cookies help us deliver our services. By using our services, you agree to our use of cookies. More information

Difference between revisions of "MAPREDUCE Elite"

From NoSQLZoo
Jump to: navigation, search
 
(85 intermediate revisions by 4 users not shown)
Line 1: Line 1:
<pre class=setup>
+
==Introducing the elite database==
#ENCODING
+
These questions will introduce the "elite" database, which contains data about the video game [https://www.elitedangerous.com/ Elite Dangerous]<br/><br/><br/>
import io
+
There are two collections, <code>commodities</code> and <code>systems</code>.<br/>Inside <code>systems</code> there are nested documents called <code>stations</code>.<br/>
import sys
+
A <b>system</b> has many <b>stations</b>, and a <b>station</b> has many trade <code>listings</code>.<br/><br/>
sys.stdout = io.TextIOWrapper(sys.stdout.buffer, encoding='utf-16')
+
Keys used in this database.
#MONGO
+
<pre>
from pymongo import MongoClient
+
    commodities:
client = MongoClient()
+
        _id, average_price, category, name
client.progzoo.authenticate('scott','tiger')
+
    systems:
db = client['progzoo']
+
        _id, allegiance, faction, government, name, population, primary_economy, security, state, stations, updated_at, x, y, z
#PRETTY
+
    systems.stations:
import pprint
+
        _id, allegiance, distance_to_star, economies, export_commodities,has_blackmarket, has_commodities, has_rearm, has_repair,
pp = pprint.PrettyPrinter(indent=4)
+
        has_shipyard, has_outfitting, faction, government, listings, max_landing_pad, name, state, type, updated_at
 +
    systems.stations.listings:
 +
        _id, buy_price, collected_at, demand, commodity, sell_price, supply, update_count
 
</pre>
 
</pre>
==Introducing the elite database **WORK IN PROGRESS==
+
Read more about the structure here: [[Elite Document Structure]]
These questions will introduce the "elite" database, which contains data about the video game [https://www.elitedangerous.com/ Elite Dangerous]<br/><br/>
 
There are two collections, <code>commodities</code> and <code>systems</code>. Inside <code>systems</code> there is are nested documents called <code>stations</code><br/>
 
A <b>system</b> has many <b>stations</b>, and a <b>station</b> imports,exports, and bans many <b>commodities</b>
 
 
<br/><br/>
 
<br/><br/>
==Document Structure==
+
 
Unlike the previous examples, here we'll be using nested documents. A nested document is simply a document that contains other documents, for example:
+
==Questions==
 +
<div class="q" data-lang="mongo" data-switches='elite'>The <code>commodities</code> collection contains the <code>name</code> and <code>average_price</code> of each commodity.<br/>
 +
There are 99 unique commodities and 15 categories.
 +
<p class="strong">Find the average price of each category, round to the nearest whole number.</p>
 +
<pre class="def"><nowiki>
 +
db.commodities.mapReduce(
 +
  function(){
 +
    emit(1, 1);
 +
  },
 +
  function(k, v){
 +
    return Array.sum(v);
 +
  },
 +
  {out: {inline: 1}}
 +
);</nowiki></pre>
 +
<pre class="ans"><nowiki>db.commodities.mapReduce(function(){emit(this.category,this.average_price);},function(k,v){return Math.round(Array.sum(v)/v.length);},{out:{inline:1}});</nowiki></pre>
 +
</div>
 +
<div class="q" data-lang="mongo" data-switches='elite'>Each system has an <code>allegiance</code>. There are three main factions: <b>The Federation</b>, <b>The Empire</b>, and <b>The Alliance</b>.<br/>
 +
<p>Non-populated systems without stations do not have an allegiance, and should be ignored.</p>
 +
<p class="strong">Show the amount of systems following each type of allegiance.</p>
 +
<pre class="def"><nowiki>
 +
db.systems.mapReduce(
 +
  function(){
 +
    emit(1, 1);
 +
  },
 +
  function(k, v){
 +
    return Array.sum(v);
 +
  },
 +
  {out: {inline: 1}}
 +
);</nowiki></pre>
 +
<pre class="ans"><nowiki>db.systems.mapReduce(function(){if (this.allegiance!=null){emit(this.allegiance,1);}},function(k,v){return Array.sum(v);},{out:{inline:1}});</nowiki></pre>
 +
</div>
 +
<div class="q" data-lang="mongo" data-switches='elite'>
 +
<p class="strong">What are the populations of the three main factions?</p>
 +
<div class="hint" title="Three main factions">["Alliance","Federation","Empire"]</div>
 +
<div class="hint" title="NaN?">Some systems are not populated and will have '''null''' population fields, make sure to exclude them using <code>!isNaN()</code>.</div>
 +
<pre class="def"><nowiki>
 +
db.systems.mapReduce(
 +
  function(){
 +
    emit(1, 1);
 +
  },
 +
  function(k, v){
 +
    return Array.sum(v);
 +
  },
 +
  {out: {inline: 1}}
 +
);</nowiki></pre>
 +
<pre class="ans"><nowiki>db.systems.mapReduce(function(){if(!isNaN(this.population)&&this.allegiance!=null&&this.allegiance!="Independent"&&this.allegiance!="Anarchy"){emit(this.allegiance,this.population);}},function(k,v){return Array.sum(v);},{out:{inline:1}});</nowiki></pre>
 +
</div>
 +
 
 +
==Harder Questions==
 +
<div class="q" data-lang="mongo" data-switches='elite'>
 +
<p class="strong">
 +
How much Hydrogen Fuel is owned by each faction?  Limit your query to the first 5000 stations.
 +
</p>
 +
<div class="hint" title="hint">
 +
The amount of stations in a system <b>and</b> the amount of listings to a station aren't fixed. <code>query</code> can be used to ensure that they exist.
 +
</div>
 +
<pre class="def"><nowiki>
 +
db.systems.mapReduce(
 +
  function(){
 +
    emit(1, 1);
 +
  },
 +
  function(k, v){
 +
    return Array.sum(v);
 +
  },
 +
  {out: {inline: 1}}
 +
);</nowiki></pre>
 +
<pre class="ans"><nowiki>db.systems.mapReduce(function(){if(this.stations)for(let i=0;i<this.stations.length;i++){let t=this.stations[i];if(t.listings&&t.allegiance)for(let s=0;s<t.listings.length;s++){let n=t.listings[s];"Hydrogen Fuel"===n.commodity&&emit(t.allegiance,n.supply)}}},function(i,t){return Array.sum(t)},{out:{inline:1},limit:5e3});</nowiki></pre>
 +
</div>
 +
<div class="q" data-lang="mongo" data-switches='elite'>A <code>power_control_faction</code> or <b>Power</b> is an individual or organisation who is in control of a system.<br/>
 +
These powers have allegiance to a faction, but the systems they control do not nescessarily have the same allegiance that they do.
 +
<div class="hint" title="Example"> At the time of writing <b>Zemina Torval</b> is allied with the <b>Empire</b> and controls <b>47</b> systems.<br/>
 
<pre>
 
<pre>
[{"name":"Jim",
+
    {   '_id': 'Zemina Torval',
   "role":"Dad",
+
        'value': {   'Alliance': 0.0,
  "age": 42,
+
                    'Anarchy': 0.0,
  "children":[{"name":"Bob", "age":2},
+
                    'Empire': 39.0,
              {"name":"Alice", "age":3}
+
                    'Federation': 3.0,
            ]
+
                    'Independent': 5.0}}]
}]      
 
 
</pre>
 
</pre>
Another change from the previous example is that <code>null</code> and empty values are no longer stored.<br/>
+
</div>
This means that some documents will have fields that others do not. Some systems will be uninhabited and have no stations. Some stations will have no listings.<br/>
+
<p class="strong">Show the allegiance of each of the power's systems</p>
To query null <b>or</b> non-existant fields we use <code><field>: null</code><br/><br/>
+
<pre class="def"><nowiki>
The average <code>commodities</code> document looks something like this.
+
db.systems.mapReduce(
 +
  function(){
 +
    emit(1, 1);
 +
  },
 +
  function(k,v){
 +
    return Array.sum(v);
 +
  },
 +
  {
 +
    query: {"power_control_faction": {"$exists": 1}},
 +
    out: {inline: 1}
 +
  }
 +
);</nowiki></pre>
 +
<pre class="ans"><nowiki>db.systems.mapReduce(function(){emit(this.power_control_faction,{[this.allegiance]:1});},function(_,v){let a={"Alliance":0,"Anarchy":0,"Empire":0,"Federation":0,"Independent":0};for(let i=0;i<v.length;i++){let b=v[i];a.Alliance+=b.Alliance||0;a.Anarchy+=b.Anarchy||0;a.Empire+=b.Empire||0;a.Federation+=b.Federation||0;a.Independent+=b.Independent||0;}return a;},{out:{"inline":1},query:{"power_control_faction":{"$exists":1}},sort:{"_id":1}});</nowiki></pre>
 +
</div>
 +
<div class="q" data-lang="mongo" data-switches='elite'>Our dataset doesn't contain the allegiance of a power:
 +
<p class="strong">Using the result from the previous question, guess the power's allegiance by the faction that the majority of their systems follow.</p>
 +
<p>To achieve this, you'll need to use the <code>finalize: function(k, v){}</code> in the third argument to find the key with the largest value.</p>
 +
<div class="hint" title="Example">
 
<pre>
 
<pre>
 
{
 
{
        "_id" : ObjectId("55af74e7402aa43f1ce7e3a3"),
+
    "_id" : "Zemina Torval",
        "name" : "Explosives",
+
    "value" : "Empire"
        "average_price" : 267,
 
        "category" : "Chemicals"
 
 
}
 
}
</pre>
+
</pre></div>
<code>systems</code> is much bigger, <div class=hint title="click here to show it.">
+
<pre class="def"><nowiki>
<pre>
+
db.systems.mapReduce(
{
+
  function(){
        "_id" : ObjectId("55b0cef2369fd55b7d4489ed"),
+
    emit(1, 1);
        "stations" : [
+
  },
                {
+
  function(k, v){
                        "max_landing_pad_size" : "M",
+
    return Array.sum(v);
                        "has_blackmarket" : 0,
+
  },
                        "has_commodities" : 1,
+
  {
                        "updated_at" : 1434929486,
+
    finally: function(k, v){
                        "has_outfitting" : 0,
+
      return v;
                        "government" : "Democracy",
+
    },
                        "has_shipyard" : 0,
+
    query: {"power_control_faction": {"$exists": 1}},
                        "type" : "Unknown Outpost",
+
    out: {inline: 1}
                        "faction" : "",
+
  }
                        "has_rearm" : 0,
+
);</nowiki></pre>
                        "allegiance" : "Federation",
+
<pre class="ans"><nowiki>db.systems.mapReduce(function(){emit(this.power_control_faction,{[this.allegiance]:1});},function(k,v){let a={"Alliance":0,"Anarchy":0,"Empire":0,"Federation":0,"Independent":0};for(let i=0;i<v.length;i++){let b=v[i];a.Alliance+=b.Alliance||0;a.Anarchy+=b.Anarchy||0;a.Empire+=b.Empire||0;a.Federation+=b.Federation||0;a.Independent+=b.Independent||0;}return a;},{finalize:function(k,v){return Object.keys(v).reduce((a,b)=>v[a]>v[b]?a:b);},out:{"inline":1},query:{"power_control_faction":{"$exists":1}},sort:{"_id":1}});</nowiki></pre>
                        "has_refuel" : 1,
 
                        "name" : "Kinsey Ring",
 
                        "listings" : [
 
                                {
 
                                        "commodity" : "Hydrogen Fuel",
 
                                        "supply" : 129630,
 
                                        "collected_at" : 1421669319,
 
                                        "update_count" : "1",
 
                                        "buy_price" : 93,
 
                                        "sell_price" : 89,
 
                                        "demand" : 0
 
                                },
 
                                {
 
                                        "commodity" : "Mineral Oil",
 
                                        "supply" : 0,
 
                                        "collected_at" : 1421669320,
 
                                        "update_count" : "1",
 
                                        "buy_price" : 0,
 
                                        "sell_price" : 326,
 
                                        "demand" : 217674
 
                                },
 
                        ],
 
                        "distance_to_star" : 2359,
 
                        "economies" : [
 
                                "Industrial",
 
                                "Refinery"
 
                        ],
 
                        "has_repair" : 1
 
                },
 
                {
 
                        "max_landing_pad_size" : "L",
 
                        "has_blackmarket" : 1,
 
                        "has_commodities" : 0,
 
                        "updated_at" : 1434929486,
 
                        "has_outfitting" : 1,
 
                        "government" : "Democracy",
 
                        "has_shipyard" : 1,
 
                        "type" : "Unknown Starport",
 
                        "faction" : "",
 
                        "has_rearm" : 1,
 
                        "allegiance" : "Federation",
 
                        "has_refuel" : 1,
 
                        "name" : "Wohler Port",
 
                        "distance_to_star" : 3520,
 
                        "economies" : [
 
                                "Industrial",
 
                                "Refinery"
 
                        ],
 
                        "has_repair" : 1
 
                }
 
        ],
 
        "name" : "1 Kappa Cygni",
 
        "faction" : "United 1 Kappa Cygni Future",
 
        "government" : "Democracy",
 
        "allegiance" : "Federation",
 
        "updated_at" : 1430938622,
 
        "state" : "None",
 
        "needs_permit" : 0,
 
        "y" : 37.78125,
 
        "x" : -117.75,
 
        "security" : "High",
 
        "z" : 11.1875,
 
        "primary_economy" : "Industrial",
 
        "population" : 24843190
 
}
 
</pre>
 
Finally, here is a list of all the keys used.
 
<pre>
 
    commodities:
 
        _id, average_price, category, name
 
    systems:
 
        _id, allegiance, faction, government, name, population, primary_economy, security, state, stations, updated_at, x, y, z
 
 
 
    systems.stations:
 
        allegiance, distance_to_star, economies, export_commodities,has_blackmarket, has_commodities, has_rearm, has_repair,
 
        has_shipyard, has_outfitting, faction, government, listings, max_landing_pad, name, state, type, updated_at
 
 
 
    systems.stations.listings:
 
        buy_price, collected_at, demand, commodity, sell_price, supply, update_count
 
       
 
</pre>
 
 
</div>
 
</div>
 +
[https://goo.gl/forms/ep8rBbCQSa381ic82 {{huge| Survey}}] <br/>
 +
Do you have thoughts about this website that you would like to share? Help improve NoSQLZoo!

Latest revision as of 14:48, 17 October 2018

Introducing the elite database

These questions will introduce the "elite" database, which contains data about the video game Elite Dangerous


There are two collections, commodities and systems.
Inside systems there are nested documents called stations.
A system has many stations, and a station has many trade listings.

Keys used in this database.

    commodities:
        _id, average_price, category, name
    systems:
        _id, allegiance, faction, government, name, population, primary_economy, security, state, stations, updated_at, x, y, z
    systems.stations: 
        _id, allegiance, distance_to_star, economies, export_commodities,has_blackmarket, has_commodities, has_rearm, has_repair,
        has_shipyard, has_outfitting, faction, government, listings, max_landing_pad, name, state, type, updated_at
    systems.stations.listings: 
        _id, buy_price, collected_at, demand, commodity, sell_price, supply, update_count

Read more about the structure here: Elite Document Structure

Questions

The commodities collection contains the name and average_price of each commodity.

There are 99 unique commodities and 15 categories.

Find the average price of each category, round to the nearest whole number.

db.commodities.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k, v){
    return Array.sum(v);
  },
  {out: {inline: 1}}
);
db.commodities.mapReduce(function(){emit(this.category,this.average_price);},function(k,v){return Math.round(Array.sum(v)/v.length);},{out:{inline:1}});
Each system has an allegiance. There are three main factions: The Federation, The Empire, and The Alliance.

Non-populated systems without stations do not have an allegiance, and should be ignored.

Show the amount of systems following each type of allegiance.

db.systems.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k, v){
    return Array.sum(v);
  },
  {out: {inline: 1}}
);
db.systems.mapReduce(function(){if (this.allegiance!=null){emit(this.allegiance,1);}},function(k,v){return Array.sum(v);},{out:{inline:1}});

What are the populations of the three main factions?

["Alliance","Federation","Empire"]
Some systems are not populated and will have null population fields, make sure to exclude them using !isNaN().
db.systems.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k, v){
    return Array.sum(v);
  },
  {out: {inline: 1}}
);
db.systems.mapReduce(function(){if(!isNaN(this.population)&&this.allegiance!=null&&this.allegiance!="Independent"&&this.allegiance!="Anarchy"){emit(this.allegiance,this.population);}},function(k,v){return Array.sum(v);},{out:{inline:1}});

Harder Questions

How much Hydrogen Fuel is owned by each faction? Limit your query to the first 5000 stations.

The amount of stations in a system and the amount of listings to a station aren't fixed. query can be used to ensure that they exist.

db.systems.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k, v){
    return Array.sum(v);
  },
  {out: {inline: 1}}
);
db.systems.mapReduce(function(){if(this.stations)for(let i=0;i<this.stations.length;i++){let t=this.stations[i];if(t.listings&&t.allegiance)for(let s=0;s<t.listings.length;s++){let n=t.listings[s];"Hydrogen Fuel"===n.commodity&&emit(t.allegiance,n.supply)}}},function(i,t){return Array.sum(t)},{out:{inline:1},limit:5e3});
A power_control_faction or Power is an individual or organisation who is in control of a system.

These powers have allegiance to a faction, but the systems they control do not nescessarily have the same allegiance that they do.

At the time of writing Zemina Torval is allied with the Empire and controls 47 systems.
    {   '_id': 'Zemina Torval',
        'value': {   'Alliance': 0.0,
                     'Anarchy': 0.0,
                     'Empire': 39.0,
                     'Federation': 3.0,
                     'Independent': 5.0}}]

Show the allegiance of each of the power's systems

db.systems.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k,v){
    return Array.sum(v);
  },
  {
    query: {"power_control_faction": {"$exists": 1}},
    out: {inline: 1}
  }
);
db.systems.mapReduce(function(){emit(this.power_control_faction,{[this.allegiance]:1});},function(_,v){let a={"Alliance":0,"Anarchy":0,"Empire":0,"Federation":0,"Independent":0};for(let i=0;i<v.length;i++){let b=v[i];a.Alliance+=b.Alliance||0;a.Anarchy+=b.Anarchy||0;a.Empire+=b.Empire||0;a.Federation+=b.Federation||0;a.Independent+=b.Independent||0;}return a;},{out:{"inline":1},query:{"power_control_faction":{"$exists":1}},sort:{"_id":1}});
Our dataset doesn't contain the allegiance of a power:

Using the result from the previous question, guess the power's allegiance by the faction that the majority of their systems follow.

To achieve this, you'll need to use the finalize: function(k, v){} in the third argument to find the key with the largest value.

{
    "_id" : "Zemina Torval",
    "value" : "Empire"
}
db.systems.mapReduce(
  function(){
    emit(1, 1);
  },
  function(k, v){
    return Array.sum(v);
  },
  {
    finally: function(k, v){
      return v;
    },
    query: {"power_control_faction": {"$exists": 1}},
    out: {inline: 1}
  }
);
db.systems.mapReduce(function(){emit(this.power_control_faction,{[this.allegiance]:1});},function(k,v){let a={"Alliance":0,"Anarchy":0,"Empire":0,"Federation":0,"Independent":0};for(let i=0;i<v.length;i++){let b=v[i];a.Alliance+=b.Alliance||0;a.Anarchy+=b.Anarchy||0;a.Empire+=b.Empire||0;a.Federation+=b.Federation||0;a.Independent+=b.Independent||0;}return a;},{finalize:function(k,v){return Object.keys(v).reduce((a,b)=>v[a]>v[b]?a:b);},out:{"inline":1},query:{"power_control_faction":{"$exists":1}},sort:{"_id":1}});

Survey
Do you have thoughts about this website that you would like to share? Help improve NoSQLZoo!